Revision [6025]

This is an old revision of RobotFriendly made by DotMG on 2005-02-16 07:50:08.

 

To prevent web SearchEngines' spiders from crawling non-content pages, most of WikkaWiki's WikiEdit, PageHistoryInfo, etc pages include robots meta tags to prevent the spiders from indexing them. This keeps the databases of the SearchEngines cleaner, at least for your website. This page aims to help make your website more friendly to robots.

To start, you probably want to add deleted pages to the list of pages which get the robots meta tag. You can use this patch:

diff -ur wiki.orig/actions/header.php wiki/actions/header.php
--- wiki.orig/actions/header.php        Tue Feb 15 21:47:56 2005
+++ wiki/actions/header.php     Tue Feb 15 21:51:43 2005
@@ -9,7 +9,7 @@
 <head>
        <title><?php echo $this->GetWakkaName().": ".$this->PageTitle(); ?></title>
        <base href="<?php echo $site_base ?>" />
-       <?php if ($this->GetMethod() != 'show' || $this->page["latest"] == 'N' || $this->page["tag"] == 'SandBox') echo "<meta name=\"robots\" content=\"noindex, nofollow, noarchive\" />\n"; ?>
+       <?php if ($this->GetMethod() != 'show' || !$this->page || $this->page["latest"] == 'N' || $this->page["tag"] == 'SandBox') echo "<meta name=\"robots\" content=\"noindex, nofollow, noarchive\" />\n"; ?>
        <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" />
        <meta name="keywords" content="<?php echo $this->GetConfigValue("meta_keywords") ?>" />
        <meta name="description" content="<?php echo $this->GetConfigValue("meta_description") ?>" />


To install a patch, place it in a file in your wiki's directory and execute: patch -p1 < filename

Sending 404 response to an unexisting page
Modify ./handlers/page/show.php like this :
if (!$this->page)
    {
        $httpversion = isset($_SERVER["SERVER_PROTOCOL"]) ? $_SERVER["SERVER_PROTOCOL"] : 'HTTP/1.1';
        header("$httpversion 404 Not Found");
        print("<p>This page doesn't exist yet. Maybe you want to <a href=\"".$this->Href("edit")."\">create</a> it?</p></div>");
    }


Note: On IE, there is a number of bytes required, and if the length of body is less than this limit, IE displays its own default content. But normally, the page should always display the content we expect (This page doesn't exist. Maybe you want to create it).
Note: There is a number of my updates on this same page, Sorry for not using Preview ...
--DotMG
Valid XHTML :: Valid CSS: :: Powered by WikkaWiki