GitHub - MyRespect/BlogCrawler: Security-related Blog Crawler

Use the URL prefix of the website as the spider name, table name, and web name.
Modify pipelines.py to create a table for the corresponding blog website.
Modify process_response in middlewares.py to process the dynamically loaded website
Write your own crawler in the spider folder
Use "xpath" helper extension in your browser to help you quickly position

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
cti_crawler		cti_crawler
all.log		all.log
main.py		main.py
readme.md		readme.md
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg

Provide feedback