python3.8 use urlib's request method and BeautifulSoup's formatting source code and search method to crawl multiple static general website images
preface
This paper mainly introduces the method of python crawling static web page pictures. It can crawl multiple web pages and is universal. At present, the method of dynamic web page image crawling has not been realized. Differences between static web pages and dynamic web pages: Static web page image: the image url address is directl ...
Posted by gamerzfuse on Wed, 02 Feb 2022 17:44:38 +0100
python third-party library beautiful soup4 document learning
output
Format the output, format the beautiful soup document tree with the prettify() method, and then output it in Unicode encoding. Each XML/HTML tag occupies a separate line
markup = '<a href="http://example.com/">I linked to <i>example.com</i></a>'
soup = BeautifulSoup(markup,"html.parser")
print(soup.prettify()) ...
Posted by yorktown on Sun, 16 Jan 2022 21:12:28 +0100