if our network has proxy , sites can not opened. want check iteratively , how many sites can accessed through our network.
find out source code of proxy block page is.
use urllib , beautifulsoup try , scrape page , parse page's source code see if can find unique can tell if site accessible or not.
for example, in office, when page blocked our proxy title tag of source code <title>network error</title>. such identifier you.
just quick idea.
so example have url's test in list , iterate through list in loop , try , scrape each site.
Comments
Post a Comment