PHP web-crawler -


i'm looking php web-crawler gather links large site , tell me if links broken.

so far i've tried modifying example on here myself. my question codei've tried grabbing phpdig site down. suggestions great on how should proceed great.

edit

the problem isn't grabbing of links issue of scale i'm not sure if script modified sufficient enough grab possibly thousands of url's tried setting depth search link 4 , crawler timed out through browser. else mentioned killing processes not overload server, please elaborate on issue.

not ready-to-use solution, simple html dom parser 1 of favourite dom parsers. let's use css selectors finding nodes on document, can find <a href="">'s. these hyperlinks's can build own crawler , check if pages still available.

you can find here.


Comments

Popular posts from this blog

php - What is the difference between $_SERVER['PATH_INFO'] and $_SERVER['ORIG_PATH_INFO']? -

fortran - Function return type mismatch -

queue - mq_receive: message too long -