java - Are there any open-source implementations of the Mercator Web Crawler -
marc najork , allan heydon have written excellent paper on java, scalable , extensible web crawler called mercator.
here resources on mercator web crawler:
- mercator presentation (pdf)
- mercator introduction (pdf)
- mercator web crawler paper (pdf)
first result in google query: "web crawling contents najork pdf"
has seen implementations of crawler (preferably java)?
update:
i'm having trouble links, i'm going try better links referenced papers. think i've fixed them now.
i've found couple of java crawlers supposed pretty close mercator:
other references welcome.
Comments
Post a Comment