SiteIndex
| |
The domain information is one per line, of the format:
domain <tab> pathFor example:
GetDave.com /user/home/httpd/html/GetDave/The path is optional, if it isn't specified than the site_index will just provide a link to the domain.
Warning: It will overwrite files in "Site_Index/" in each root directory!
As a default, it does a recursive listing of all files under the
root directory, showing all HTML files that it finds. It won't show
directories that don't contain HTML.
You can prune/avoid any directories in your tree by creating
one of the following files in that directory:
.no_index Won't include directory in site index .no_contents Will include directory without any contents.
You can also ignore parts of the tree using the klunky "-ignore <regex>" option. Some examples:
-ignore '/images$' Ignore any "images" directories -ignore '/(images|thumbnails)$' Multiple ignores -ignore '/\.' Ignore dot directoriesAt the same time, you can specify which files get indexed. Right now it just indexes html files. You can specify this regex with -index. Example:
-index '\.(s?html?|txt)$' Index .shtm, .shtml, .htm, .html, .txt
You can also specify an optional importance for each domain, using the format:
domain <tab> path <tab> importanceImportance is a value from 1-5: