. . .
Re-index log file - Server info offering: Server software, environment, MySQL, PDF-converter, image functions, php.ini file php integration, php security info. Each item holding lists of details. All text links, media links and thumbnails are active linked. As stated in chapter Introduction , this search engine uses some php libraries and
. . .
'Tips & Tricks & Mods' 3.1 All options It is possible to spider web pages from the command line, using the syntax: php spider.php <options> where <options> are: -all Reindex everything in the database. -eall Erase database and afterwards re-index all. -new Index all new URLs in database which had not jet been indexed. -erase Erase the
. . .
multiple strings). For example, for spidering and indexing http://www.domain.com/test.html to depth 2, use: php spider.php -u http://www.domain.com/test.html -d 2 If you want to reindex the same URL, use: php spider.php -u http://www.domain.com/test.html -r 3.2 Multithreaded indexing For command line operation parallel indexing has no
. . .
not jet been indexed <-new> Simply start several threads and add individual IDs to the option parameter like php spider.php -new1 php spider.php -new2 etc. The IDs will be added to the name of the corresponding log files like: db2_100524-21.47.56_ID1.html (log file of first thread) db2_100524-21.48.12_ID2.html (log file of second thread)
. . .
log file will be unreadable. 3.2.2 Re-index all To be invoked by once preparing the database with the command php spider.php <-preall> This will reset all 'Last indexed' tables to '0000', but will not erase the content of all the other tables. So the check whether the content of a page has changed (MD5sum) is still available for a fast