Skip to content

Multi threaded Go-based crawler with per-domain limits, queue-based scheduling, bloom filter deduplication, robots.txt compliance, user-agent rotation and disaster recovery. This project is under active development, README coming soon! For now, simply clone the project and run main.go to use it.

Notifications You must be signed in to change notification settings

daniel-maxwell/WebCrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 

About

Multi threaded Go-based crawler with per-domain limits, queue-based scheduling, bloom filter deduplication, robots.txt compliance, user-agent rotation and disaster recovery. This project is under active development, README coming soon! For now, simply clone the project and run main.go to use it.

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages