Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Daily Scraper | 88 | 21 hours ago | 16 | agpl-3.0 | HTML | |||||
Fetches information about every webpage 🤖 | ||||||||||
Webpage Rs | 31 | 6 | 6 days ago | 11 | January 06, 2023 | 2 | Rust | |||
Small Rust library to fetch info about a web page: title, description, language, HTTP info, RSS feeds, Opengraph, Schema.org, and more | ||||||||||
Awesome Seo Scripts | 22 | 3 years ago | JavaScript | |||||||
Random SEO scripts | ||||||||||
Metatags | 19 | 5 years ago | PHP | |||||||
A Laravel package to fetch webpage metadata ( Open Graph | Twitter | Facebook | Article ) | ||||||||||
Webpage Scraper | 15 | 7 years ago | 1 | Python | ||||||
This is a flask based application which fetches images, hyperlinks, indented source code and text after stripping the html tags from a given webpage and allows you to save them onto your system in a directory or text file with a name of your choice. | ||||||||||
Web2db | 13 | 3 years ago | 6 | September 22, 2020 | Python | |||||
Fetch webpage full-text, persist link and full text to SQLITE3 db, resumable with tqdm progressbar. | ||||||||||
Gangsta | 8 | 6 years ago | apache-2.0 | PHP | ||||||
Fetch OpenGraph data from a url and display in ExpressionEngine templates | ||||||||||
Metadog | 7 | 2 | 7 years ago | 8 | November 02, 2016 | mit | JavaScript | |||
Sniffs out and fetches open graph and schema.org metadata from webpages. | ||||||||||
Webpage | 6 | 5 days ago | 13 | HTML | ||||||
Sources for the JMLR webpage | ||||||||||
Xst Google Calendar Events | 4 | 2 years ago | 3 | mit | JavaScript | |||||
A react Component, which fetch Calendar entries from Google Calender and output them to your Webpage. It's based on Javascript (react) and create an sortable Event-List Table. No need for PHP or Database-Connection. |
webpage-scraper
is a flask based application which allows the users to :
To install requirements:
[sudo] pip install requirements
If you don't have pip installed, this Python installation guide can guide you through the process.
To install MongoDB Community Edition:
on OSX, refer to: https://docs.mongodb.com/manual/tutorial/install-mongodb-on-os-x/
on Ubuntu, refer to: https://docs.mongodb.com/manual/tutorial/install-mongodb-on-ubuntu/
on Windows, refer to: https://docs.mongodb.com/manual/tutorial/install-mongodb-on-windows/
Make sure you have MongoDB installed
git clone http://github.com/mansimarkaur/webpage-scraper
cd webpage-scraper
python crawler_flask.py
Open http://127.0.0.1:5000/ in your browser. Input URL and have fun 👍