GitHub

ReadFile.py: used to list the directory contents without using os.scandir, os.walk, glob.

SearchInsert.py : The list can grow to a maximum length of 10. Make this limit as a configurable at

        the instantiation part of the class.

        ○ The insert function should get a string as its argument, insert the string to

        the list and return the index in which it was inserted. If the length of the list

        reaches maximum length, the oldest item accessed with select should be

        deleted and the new item should be inserted in its place.

        ○ The select function should accept an integer as argument and return the

        value at that index in the list.

Scrape.py: Environment Setup for Scrapy to scrape dynamic website

  1) Installation for python3
    > sudo apt-get install python3.5

  2) Installation for pip
    > sudo apt-get install python3-pip

  3) Installaton for scrapy
    > sudo pip3 install scrapy

  4) Installation for mongodb:
    > sudo apt-get install mongodb

  5) Installation for pymongo
    > sudo pip3 install pymongo

  6) Installation for BeautifulSoup
    > sudo pip3 install bs4


  Deploy 	project Scraper:

    1) scrapy startproject scrapnykaa
    2) The folder constitue of files and folders
    3) Inside the main project there is directory for spider
      In scrapynykaa/scrapynykaa/spiders create file name called scrapper.py
    4) Copy the contents from file name scraper.py and paste it in the file scrapper.py. 
    5) Run command: scrapy crawl scrapper 
    6) It will parse all data with
      product-id, image-link, and product-title and store it in monogdb.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
ReadDirectory.py		ReadDirectory.py
SearchInsert.py		SearchInsert.py
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

utKarshOO9/pystuff

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages