if our network has proxy , sites can not opened. want check iteratively , how many sites can accessed through our network.
find out source code of proxy block page is.
use urllib
, beautifulsoup
try , scrape page , parse page's source code see if can find unique can tell if site accessible or not.
for example, in office, when page blocked our proxy title tag of source code <title>network error</title>
. such identifier you.
just quick idea.
so example have url's test in list , iterate through list in loop , try , scrape each site.
Comments
Post a Comment