wwwchecklinks
[ -imagelinks
yes
|no
]
[ -checkalllinks
yes
|no
]
rooturl1 ... rooturln
[ -prune
url1 ... urln ]
wwwchecklinks
is a program that looks for broken links in
web page hierarchies. The root of the hierarchy to be checked is
determined by one or more URLs given on the command line.
The result is displayed in an X window which
allows you to browse the result (even while the search is in progress). The
result can also be saved to two files: one summary file (called
CheckLinks.Summary
) and one complete
cross reference listing for the checked documents
(called CheckLinks.Report
).
-imagelinks
yes
|no
]
yes
means that links to inlined images are
checked. This is the default.
-checkalllinks
yes
|no
]
yes
means that all links are
checked. no
means that only links to documents on
the same server as one of the root documents are checked. The
default is no
.
-prune
url1 ... url1
]
wwwchecklinks
:
wwwchecklinks
http://www.cs.chalmers.se/~hallgren/
http://www.cs.chalmers.se/~hallgren/
) and check them
too.
wwwchecklinks -checkalllinks yes
http://www.cs.chalmers.se/~hallgren/
wwwchecklinks
http://www.cs.chalmers.se/~hallgren/
-prune http://www.cs.chalmers.se/~hallgren/naptv
The top part of the window shows a summary of the result, which is
updated only when you press the Update button. You can press Update
at any time to see how the search is progressing. You can also press
the Save button at any time to save the information collected so far.
(The files will be called CheckLinks.Summary
and
CheckLinks.Report
.)
The bottom part of the window consists of three boxes showing the progress of the search. From top to bottom they show: which document is being checked at the moment, server connection status, which link is being checked at the moment.
When the search is complete (and you have pressed the Update button) the window will look something like this:
The summary window shows one line for each URL encountered during the search. The lines have the following general format:
reference_count ->
information URL
where reference_count is the number of references to this
URL, information is some brief information about the URL or
the document it refers to and URL is the URL in question.
The URLs encountered during the search are displayed in the following order:
BAD 404 Not Found
.
The web server replied that there is no document with the given URL.
BAD 301 Moved Permanently
.
The web server
replied that the document has been moved. The most common
reason for this is that you forgot to put a
/
at the end of an URL that
refers to a directory. You usually don't notice this error in
ordinary Web browsers since they automatically
reissue the request with the correct URL. This slows down
the access and increases the load on the server, though.
The broken links are ordered by the error number and the number of references to them.
Not checked
.
text/html
) of the document and the number of
unchecked, broken and
working links in the document. The
documents are ordered by the number of broken links.
? ? ?
(indicating that the number of
working/broken links is not known).
CheckLinks.Summary
when you press the Save button.
Clicking on a line in the summary window opens a window containing more detailed information on that link/document. For example, clicking on the line
3 -> text/html 7 1 24 http://www.cs.chalmers.se/~hallgren/
(which by the way says that there are three references to my home page
among the documents checked and that my home page contains 7 unchecked links, one
broken link and 24 working links)
in the above window produces the following information:
This information (for all documents) is saved inDocument http://www.cs.chalmers.se/~hallgren/ Type: text/html References to this document from: http://www.cs.chalmers.se/~hallgren/lic-abstract.html http://www.cs.chalmers.se/~hallgren/videoband.html http://www.cs.chalmers.se/~hallgren/klockan.cgi BAD links http://www.cs.chalmers.se/Fudgets/ Unchecked links http://lips.cs.chalmers.se:8888/trams gopher://sunic.sunet.se:43/0thomas-h.pp.se gopher://cs.chalmers.se:79/0/w hallgren http://slip-02.cs.chalmers.se/ ftp://ftp.cs.chalmers.se/pub/users/hallgren http://www.chalmers.se/ Good links http://www.cs.chalmers.se/~hallgren/count.cgi http://www.cs.chalmers.se/~hallgren/klockan.cgi http://www.cs.chalmers.se/~hallgren/wget.cgi http://www.cs.chalmers.se/~hallgren/ibtelpre.html(+ the remaining 20 good links)
CheckLinks.Report
when you press the Save button.
{cs,math,md,mdstud}.chalmers.se
.