Skip to content

boxxello/Proxy-Scraper-pkg

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Proxy Scraper

Proxy Scraper is an open source tool that finds public proxies from multiple sources and asynchronously checks them.

image

pip install -U git+https://github.com/boxxello/Proxy-Scraper-pkg.git

Features

  • Support protocols: HTTP(S)
  • Automatically removes duplicate proxies
  • Validates the proxies that are inputted from an input file

Requirements

Python 3.8+

All the requirements will be installed with the setup.py - or you can manually installed them by running.

pip install -r requirements.txt

Proxy-Scraper-pkg Hello World

from proxy_scraper.getproxy import GetProxy

proxy_scraper = GetProxy()
proxy_scraper.init()
proxy_scraper.load_input_proxies()
proxy_scraper.validate_input_proxies()
proxy_scraper.load_plugins()
proxy_scraper.grab_web_proxies()
proxy_scraper.validate_web_proxies()
proxy_scraper.save_proxies()

either way you can also run it from a console.
Usage:

pyhton -m proxy_scraper [--input name_of_the_file.txt] [--output name_of_the_file2.txt]
                        [--debug]

Experiencing issues?

Make sure that You using latest version!!!

pip install -U git+https://github.com/boxxello/Proxy-Scraper-pkg.git

TODO

  • Save the current ips in a db.
  • Make an API to retrieve the latest scraped ips without having you to run it on your machine.
  • Turn every plugin into an instance of a child which inherits its properties from a father.

Contributing

License

Licensed under the Apache License, Version 2.0

Thanks goes to these existing libraries

  • This product include portions of code from GetProxy.
  • This product include portions of code from ProxyBroker2.

** Disclaimer & WARNINGS:

  1. Use this ONLY for Educational Purposes! By using this code you agree that I'm not responsible for any kind of trouble caused by the code.
  2. Make sure web-scraping is legal in your region.

About

A simple but effective proxy scraper and validator

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages