using wget for website broken link checking

lynx is broken for https website link checking. We use wget instead to check for broken internal links on your website.

wget --spider -r -nd -nv -w1 -o mysite.log

That command will tell you which internal website links are broken.

Near the bottom of mysite.log will be

Found no broken links

Assuming you use .html or .md files on your computer, you can search them for text with Notepad++ or findtext

findtext "*.html" 

wget spider options

don’t store HTML files retrieved
put output file in the current directory
non-verbose. Minimal messages output
wait 1 second between requests (don’t get banned by your own server for false scraping detection)



Leave a Comment