Skip to content

Scrapeur is a Python script for scraping emails, by parsing web pages.

License

Notifications You must be signed in to change notification settings

gregorysimoes/scrapeur

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

scrapeur.py

Scrapeur is a Python script for scraping emails, by parsing web pages.

Released under GNU General Public Licence

Requirements

  • sys
  • argparse
  • re
  • urllib2
  • csv

Instructions

Run scrapeur.py with arguments

python scrapeur.py -h

arguments:
  -h, --help     show this help message and exit
  -i INPUT_PATH  path to the csv file with urls
  -o OUT         out file for saving data

To do

  • scrap a single url by passing it in
  • add a user-agent
  • scrap website with login authentification
  • scrap specific content (XPath or CSSSelectors)

About

Scrapeur is a Python script for scraping emails, by parsing web pages.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages