Add root folder onto URL Using Regex

Add root folder onto URL Using Regex - php

I'm trying to transform any URL in PHP and add a root folder onto it using regex.
Before:
http://domainNamehere.com/event/test-event-in-the-future/
After:
http://domainNamehere.com/es/event/test-event-in-the-future/
Any ideas?

A simple solution without regex:
$root_folder = 'es';
$url = "http://domainNamehere.com/event/test-event-in-the-future/";
$p = strpos($url, '/', 8);
$url_new = sprintf('%s/%s/%s', substr($url, 0, $p), $root_folder, substr($url, $p+1));
EDIT: JavaScript solution is pretty much the same:
var root_folder = 'es';
var url = "http://domainNamehere.com/event/test-event-in-the-future/";
var p = url.indexOf('/', 8);
var url_new = url.substring(0,p) + '/' + root_folder + url.substring(p);
Of course, for a live application you should also check if p actually gets a valid value assigned (that means if the slash is found) because you might have an invalid url in your input or empty string.

$url = 'http://domainNamehere.com/event/test-event-in-the-future/'; //or the function you use to grab it;
$url = str_replace('domainNamehere.com','domainNamehere.com/es', $url);
quite dirty but effective and without regex, assuming your "es" folder is always in that position (I think so)

If the domain name is always the same you can use:
$string = str_replace('domainNamehere.com', 'domainNamehere.com/es', $url);

Untested:
$url = preg_replace('#(?<=^[a-z]+://[^/]+/)#i', "es/", $url);
Use '#' to delimit the regular expression so the slashes don't have to be escaped.
(?<=...) searches for a match for [a-z]://[^/]+/ without including it in the matched string.
[a-z]+://[^/]/ matches a sequence of letters followed by :// followed by non-slashes, then a slash. This will handle all web protocols, particularly http and https.
The little i makes the search case-insensitive.
The replacement just inserts es/ after the match.

This is the most succinct way that I could think of how to do it.
$new_url = preg_replace('#(?<=\w)(?=/)#', '/en', $url, 1);
It will insert whatever you put in the 2nd parameter into the string just before the first slash that also has a proceeding alphanumeric character.
Tested with PHP 5.3.6

Related

PHP file get content - only specific part of the text

Hi just need to solve this php issue
What I need is to get only specific part of the text that I get from remote server. So remote server gives me this kind of content
......":"http://xxxxxxxx.com?source=yyyyy&service=zzzzz&token=47baiufbweiwrwrqwr21","...........
And I need to get back only this part: token=47baiufbweiwrwrqwr21
thank for help

Use parse_url() and preg_match()
$url = 'http://xxxxxxxx.com?source=yyyyy&service=zzzzz&token=47baiufbweiwrwrqwr21';
$query = parse_url($url, PHP_URL_QUERY);
preg_match('/token=([a-z0-9]+)/i', $query, $token);
echo $token[1];
parse_url() returns the Components of an URL.
You want to have a specific Query-Parameter, so you use the PHP_URL_QUERY-Component. After this you extract the token with a Regular Expression that searches for the Parameter-Name "token" followed by a "=". Then you catch all characters you want (a-z and 0-9). You set the i-Delimiter for a caseinsensitive Search. $token[1] returns the Match within the Subpattern (Brackets).
And if you get the URL from a File you can read it into a String using file_get_contents().

You can use substr and stripos
This will make sure you get the correct part even if the variables are in a different order and is not case sensitive.
https://3v4l.org/q35f0
$url = 'http://xxxxxxxx.com?source=yyyyy&token=47baiufbweiwrwrqwr21&service=zzzzz';
$pos = stripos($url, "token");
$pos2 = stripos($url, "&", $pos);
If($pos2 > 0){
Echo substr($url, $pos, $pos2-$pos);
}Else{
Echo substr($url, $pos);
}
Edit sorry my phone autocorrected my stripos to strpos. Funny how my phone is starting to learn php.

Match regex i url

I am trying to strip out a page query, in the middle of building pagination on a page.
The url is
$url = www.example.com?category=computers&page=10
Regex
$cleanUrl = preg_replace('/page=[0-9,]+&/', '', $url);
This isn't finding a match though?

Your regex is failing because you are searching for an ampersand after the page parameter with its value.
You can make the ampersand optional by appending a question mark after it.
$url = 'www.example.com?category=computers&page=10';
$cleanUrl = preg_replace('/page=[0-9,]+&?/', '', $url);
Example, https://regex101.com/r/uW8uV8/1
For a more through regex try out:
<?php
$url = 'www.example.com?&category=cat&page=10';
$cleanUrl = rtrim(preg_replace('/(&|\?)page=[0-9,]+&?/', '$1', $url), '&?');
echo $cleanUrl;
This requires the rtrim function though to remove the trailing parameter glue. This will account for...
Page by itself, www.example.com?page=10
Page as the last parameter, www.example.com?species=wolf&page=10
Page as the first parameter, www.example.com?page=10&species=wolf
Page as a middle parameter, www.example.com?cat=tom&mouse=jerry&page=10&species=wolf

You should start and end your pattern with ~. This will work:
preg_replace('~&page=[0-9]+~', '', 'www.example.com?category=computers&page=10');

Get vine video id using php

I need to get the vine video id from the url
so the output from link like this
https://vine.co/v/bXidIgMnIPJ
be like this
bXidIgMnIPJ
I tried to use code form other question here for Vimeo (NOT VINE)
Get img thumbnails from Vimeo?
This what I tried to use but I did not succeed
$url = 'https://vine.co/v/bXidIgMnIPJ';
preg_replace('~^https://(?:www\.)?vine\.co/(?:clip:)?(\d+)~','$1',$url)

basename maybe?
<?php
$url = 'https://vine.co/v/bXidIgMnIPJ';
var_dump(basename($url));
http://codepad.org/vZiFP27y

Assuming it will always be in that format, you can just split the url by the / delimiter. Regex is not needed for a simple url such as this.
$id = end(explode('/', $url));

Referring to as the question is asked here is a solution for preg_replace:
$s = 'https://vine.co/v/bXidIgMnIPJ';
$new_s = preg_replace('/^.*\//','',$s);
echo $new_s;
// => bXidIgMnIPJ
or if you need to validate that an input string is indeed a link to vine.co :
$new_s = preg_replace('/^(https?:\/\/)?(www\.)?vine\.co.*\//','',$s);
I don't know if that /v/ part is always present or is it always v... if it is then it may also be added to regex for stricter validation:
$new_s = preg_replace('/^(https?:\/\/)?(www\.)?vine\.co\/v\//','',$s);

Here's what I am using:
function getVineId($url) {
preg_match("#(?<=vine.co/v/)[0-9A-Za-z]+#", $url, $matches);
if (isset($matches[0])) {
return $matches[0];
}
return false;
}
I used a look-behind to ensure "vine.co/v/" always precedes the ID, while ignoring if the url is HTTP or HTTPS (or if it lacks a protocol altogether). It assumes the ID is alphanumeric, of any length. It will ignore any characters or parameters after the id (like Google campaign tracking parameters, etc).
I used the "#" delimiter so I wouldn't have to escape the forward slashes (/), for a cleaner look.

explode the string with '/' and the last string is what you are looking for :) Code:
$vars = explode("/",$url);
echo $vars[count($vars)-1];

$url = 'https://vine.co/v/b2PFre2auF5';
$regex = '/^http(?:s?):\/\/(?:www\.)?vine\.co\/v\/([a-zA-Z0-9]{1,13})$/';
preg_match($regex,$url,$m);
print_r($m);
1. b2PFre2auF5

Trimming characters off a string

Given the following URL:
http://www.domain.com/reporting/category-breakdown.php?re=updated
I need to remove everything after the .php
It might be "?re=updated" or it could be something else. The number of characters won't always be the same, the string will always end with .php though.
How do I do this?

To find the first position of a substring in a string you can use strpos() in PHP.
$mystring = 'http://www.domain.com/reporting/category-breakdown.php?re=updated';
$findme = '.php';
$pos = strpos($mystring, $findme);
After, you have the position of the first character of your substring '.php' in your URL. You want to get the URL until the end of '.php', that means the position you get + 4 (substring length). To get this, you can use substr(string,start,length) function.
substr($mystring, 0, $pos + 4);
Here you are!

Find the first indexOf (".php"), then use substring from char 0 to your index + the length of (".php");

3 line solution:
$str = "http://www.domain.com/reporting/category-breakdown.php?re=updated";
$str = array_shift(explode('?', $str));
echo $str;
Note: it's not fool-proof and could fail in several cases, but for the kind of URLs you mentioned, this works.

Here is another way to get the non-query-string part of a url with PHP:
$url = 'http://www.domain.com/reporting/category-breakdown.php?re=updated';
$parsed = parse_url($url);
$no_query_string = $parsed['scheme'] . '://' . $parsed['hostname'] . $parsed['path'];
// scheme: http, hostname: www.domain.com, path: /reporting/category-breakdown.php
That will handle .php, .phtml, .htm, .html, .aspx, etc etc.
Link to Manual page.

Strange behavior of preg_replace

I want to switch the protocol of a link. If it is http, it should become https, and https should become http. I'm using pre_replace but something is going wrong.
Could someone look at my code and tell me what I am missing in my thinking process?
Here is the code:
$pattern = array(
0 => '/^(http\:)/',
1 => '/^(https\:)/'
);
$replace = array(
0 => 'https:',
1 => 'http:'
);
ksort($pattern);
ksort($replace);
$url = 'http://someurl.com';
echo $url."<br />";
$url = preg_replace($pattern, $replace, trim($url),1);
die($url);

You do not need to escape :, it is not a special character.
You don't need a capture group ().
You don't need to call ksort(), your arrays are already sorted by key when you declare them.
You appear to have your code replacing 'http' with 'https' AND replacing 'https' with 'http'. Why?
$url = preg_replace('/^http:/', 'https', trim($url)); will work just fine if you're simply looking to force to https.
edit
I still don't know why anyone would want to switch both http/https concurrently, but here you go:
function protocol_switcheroo($url) {
if( preg_match('/^http:/', $url) ) {
return preg_replace('/^http:/', 'https:', $url); // http to https
} else if( preg_match('/^https:/', $url) ) {
return preg_replace('/^https:/', 'http:', $url); // https to http
} else {
return $url; // for URIs with protocols other than http/https
}
}
You need to separate out the calls to replace so that you do not accidentally chain them like in the original code in the question.

The reason this isn't working for http -> https (but does work for https -> http) is that preg_replace() first changes the http to https with the first set of key/variable (0), but then immediately back to https -> http, because then the second set of variables (1) in each array is another valid match.

//$url = 'http://example.com/https://www';
$url = 'https://example.com/http://www';
$url = (0 === strpos($url, 'http:'))
? substr_replace($url, 's:', 4, 1)
: substr_replace($url, '', 4, 1);
echo $url;
This will convert HTTP -> HTTPS and HTTPS -> HTTP
It does not use a regex which would be slower, and does not use str_replace() which can inadvertently replace other portions of the URL. It will only replace the first prefix.
Breakdown : it looks to see if the URL begins with http: is it does it will replace the 5th character : with s: making it HTTPS. Otherwise it will replace the 5th character s with nothing making it HTTP.

Your URL is getting replaced twice. First, first expression matches and http://someurl.com becomes https://someurl.com. Then, second expression matches and https://someurl.com becomes http://someurl.com.
It's easier to see with this other example:
echo preg_replace(
array('/fox/', '/turtle/'),
array('turtle', 'sparrow'),
'fox', 1);
... that prints sparrow.

The problem you have is that preg_replace() does the two replacements one after the other, so that after the first one has run, the second one reverses the effect of the first one.
You need to specify both patterns in a single expression in order to have them run together.
I suggest using preg_replace_callback() instead of preg_replace(). With this, you can write a more complex output expression, which makes it easier to combine them into a single pattern. Something like this will do the trick:
$outputString = preg_replace_callback(
'/^(http|ftp)(s)?(:)/',
function($matches) {return $matches[1].($matches[2]=='s'?'':'s').':';},
$inputString
);
Hope that helps.
[EDIT] Edited the code so it works for ftp/ftps as well as http/https, after comment by OP.

We Keep Coding

PHP, A popular general-purpose scripting language that is especially suited to web development.

Add root folder onto URL Using Regex - php

I'm trying to transform any URL in PHP and add a root folder onto it using regex. Before: http://domainNamehere.com/event/test-event-in-the-future/ After: http://domainNamehere.com/es/event/test-event-in-the-future/ Any ideas?

$url = 'http://domainNamehere.com/event/test-event-in-the-future/'; //or the function you use to grab it; $url = str_replace('domainNamehere.com','domainNamehere.com/es', $url); quite dirty but effective and without regex, assuming your "es" folder is always in that position (I think so)

If the domain name is always the same you can use: $string = str_replace('domainNamehere.com', 'domainNamehere.com/es', $url);

This is the most succinct way that I could think of how to do it. $new_url = preg_replace('#(?<=\w)(?=/)#', '/en', $url, 1); It will insert whatever you put in the 2nd parameter into the string just before the first slash that also has a proceeding alphanumeric character. Tested with PHP 5.3.6

Related

PHP file get content - only specific part of the text

Match regex i url

Get vine video id using php

Trimming characters off a string

Strange behavior of preg_replace

Categories

Resources