User documentation

Usage:

-h [ site name ]
Example of valid site name:
http://site.com
https://site.com
https://site.com:port
site.com

-D : enables debug mode

-T [report type] : Default is HTML
Report types:
1 : ASCII
2 : HTML

-U [site username] : set the username for Basic Authentication

-P [site password] : set the password for Basic Authentication

-X [proxy host] : set the proxy host to use

-C [proxy port] : set the proxy port to use. Default is 80.

Special keys
S : If you type the 's' key in the console where metis is running, it will stop the spider.

more to come ...

Metis configuration files
You can control the way the code functions with the following configure files located in the conf directory of your installation path.

webglobal.properties

Option Description Default Value
default_port Default port for HTTP connections
80
debug

0 : disable debug

1: enable debug

0
useragent The value will be used in the User-Agent request header for all HTTP/SSL request
Riding the Snake v1.3
proxy_host Proxy host to use
http_password Password used for each requests
http_user User used for each requests
timeout_retries Number of times to retry a request when a timeout occurs
0
socket_timeout Number of mili seconds before a request will timeout
10000
proxy_port Proxy port to use
80
max_request_download This set the maximum file size the engine will download. This protects again downloads of large files when the engine is thinking the file content is simple html. Setting this value to 0 will tell the engine to download any file size.
1572864
potential_html_file_ext This is a list of file extensions that the engine considers as containing html. This is used to bypass some web server protections software. This was developed to allow spidering a site protected by SecureIIS from Eeye.com

htm;html;shtml;stm;pl;cfm;

php;php3;mv;cgi;asp;css


threadmanager.properties

Option Description Default Value
max_threads This set the maximum number of request the engine will run in concurrence.
100