Internal page - don't want it crawled - php

Please indulge my simplistic example:
Let's say I have 1 million pages on my site. Each page has an ID number (i.e. www.mysite.com/product.php?id=1 .... upto id=1,000,000
I want to make a page that allows users to email themselves the link to the page (or email a friend). This page is also dynamic and comes from the product.php, so like this:
www.mysite.com/email_a_friend.php?id=1 .... upto id=1,000,000
Imagine I do this and now google wants to crawl 2 million pages instead of 1 million. This seems pointless since the page doesn't have anything of any value on the email page. Also, consider that the email page would actually be a pop-up window on the product.php page.
What is the best way to handle this?
Should I block it (email_a_friend.php) through a robots.txt file?
Or should I perhaps used java or something else for the link that google would ignore anyway?
Thanks!!

If I'm understanding your question correctly, you have an e-commerce site and want to implement a share button for a particular product. What you don't want to happen is have google index the page that the share button goes to.
If that's the case, just add a rel="nofollow" to the link in your php that generates the link and you should have no problem.
Here is a google help article that explains how google does not follow links with the nofollow attribute: http://support.google.com/webmasters/bin/answer.py?hl=en&answer=96569

Related

Get content from another site with no limit

I have permission to use the table with a link to its inclusion on its website, where the name will be redirected to the landing page. However, the website owner is not able to say whether and how to modify the address of the page to display all the records in the table at once without paging.
It is even possible? If so, what you need to use GET?
http://www.newconnect.pl/?page=1045&id_firm=&id_tr=&ncc_index=ESC&ph_main_content_order=&ph_main_content_order_type=&ph_main_content_offset=0
Currently on one page displays only 20 records.
First of all ask site owner if it has some API to share data. It would be much easier.
If not you always can write a crawler and move from page to page, to get data. I strongly recommend to use Simple HTML DOM Parser to get website content.

Indexing a link under an if statement?

will index Google any links witch are under an if statement?
Ex
<a"; if($user_category=="register2"){ $link="href='".$databaselink; echo $link;}"> BOOK</a>
The links are pdf on my server.
WIll that link be seen by crawl and will be indexed?
Mention that this content is displayed without beein log in. is public. after log in user get a category wich will display the links.
I didn't find your question or code all that clear, however here's what I think you're after.
Google will take whatever the link is at the time it crawls the page.So whatever your default link is, the crawler will take.
The crawler will not see links which require a user to login.

Hide Javascript file from Analytics

I apologize ahead of time for the non descriptive title, as I wasn't really sure how to word this.
I've currently switched some of my Wordpress sites that have a responsive design that implement a slider over to WooSlider. Works super well, and I love it. However, there is something stopping me from switching all of my sites over. And I understand this is not a WooSlider only fault, but it's something I cannot Google and find out.
This is happening on every page view, even those without a slider.
In Google Analytics it shows domain.com/?wooslider-javascript=load&t=1352743207&ver=1.0.0 as a page view. For every single page. I obviously don't want this, but I don't know how to get rid of it.
Another example of this happening is using Gravity Forms with a referrer info plugin that shows page views, search query, browser, etc.
When the form is sent, the following is sent via email.
Page visited 1: domain.com/?wooslider-javascript=load&t=1352743207&ver=1.0.0 (http://domain.com/?wooslider-javascript=load&t=1352743207&ver=1.0.0)
Page visited 2: domain.com/about (http://domain.com/contact/about/
Page visited 3: domain.com/?wooslider-javascript=load&t=1352751787&ver=1.0.0 (http://domain.com/?wooslider-javascript=load&t=1352751787&ver=1.0.0)
Page visited 4: domain.com/contact/ (http://domain.com/contact/)
So obviously I don't want that js file to show up as a page view. How can I remedy this?
Thanks!
Google Analytics Configuration Mistake #2: Query String Variables
wooslider-javascript,t,ver

Redirecting outbound links and rel="nofollow" attribute - what is the difference?

As far as I know many websites add rel="nofollow" attribute to all outbound links inside their forum's posts. As I understand, that way they tell search robots not to use those links for ranking webpages. Also I've noticed that some forums use inside redirect (I'm not sure if this is the right term though) for outgoing links. Let's say the forum url is http://someforum.com. So when I post with a link
Hi this is [url="http://mysite.com"]my site[/url]
The link transforms to something like this
Hi this is my site
I suspect that the meaning of this is the same as adding rel="nofollow" atttribute.
Am I wright? If yes, is there any sense in using this kind of redirection and why not just use a rel="nofollow" attribute instead?
This kind of redirecting is used for several reasons. Here are some I am aware of:
tracking outgoing traffic leaving the own site
displaying a warning page that the user is leaving the site now with the ability to cancel within a few seconds and go back
The 2nd point gives you a chance to keep traffic on your site. And there may be legal reasons in countries like Germany here. In Germany you are responsible even for content when it is not your own but you are linking to it. So in Germany you must check the linked content on a regular basis and warn users that the linked content is not under your control. This can be done on such an extra redirect page.
I am not a lawyer but this is one of the most discussed internet-related legal issues here.
How the redirection is done will determine if ranking juice is past to the recipient.
A 301 Redirect will work almost like a direct link, with a little loss of ranking in the process.

Redirect Search results from google to a specific page

HI,
I have a specific requirement, when a user searches in google.com which returns me a list of results and one of the results is my website but it is pointing to a different page.
I need help in 2 places,
Client is planing to enhance my website so that what ever link of my website appears in google results should show as cars.com/search.jsp?search=audi. I will be having the logic in my search.jsp or search.php page which navigates the user to an appropriate page.
Or will the use the http_referrer stuff in all the pages of my website to redirect to different pages helps me in this case ?
Is this possible to implement, please guide me regarding this.
Thanks
Gopi
every link of my website appears in google results should show as cars.com/search.jsp?search=audi. I will be having the logic in my search.jsp or search.php page which navigates the user to an appropriate page.
Hum? The only way to do this is to have Google index exactly one page – that is, cars.com/search.jsp?search=audi. You probably don't want that.
Or will the use the http_referrer stuff in all the pages of my website to redirect to different pages helps me in this case ?
This is a possibility, but beware that if show pages that are significantly different than what Google indexes, you'll be violating their terms and risk yourself to being removed from Google's index.

Categories