Web2db

Fetch webpage full-text, persist link and full text to SQLITE3 db, resumable with tqdm progressbar.
Alternatives To Web2db
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Web2db13
3 years ago6September 22, 2020Python
Fetch webpage full-text, persist link and full text to SQLITE3 db, resumable with tqdm progressbar.
Firelinks10
11 years agoRuby
Sync elinks with Firefox
Web Scraping Box Office Mojo6
4 years agoPython
for each year scraped the table and collected all tables and made sqlite database.
Reask4
5 years agomitJavaScript
Reask is a project developed with React&Flask. [ Reask 是用 React&Flask 开发的全栈项目 ] :rocket:
Sqlite_bookstore2
7 years agoPython
An example SQLite implementation of an Online Bookstore (well... Marketplace) for my Database Systems class.
Alternatives To Web2db
Select To Compare


Alternative Project Comparisons
Readme

web2db

Fetches the full text of input URLs and persists them to sqlite3 DB file.
Fetching is resumable and comes with a progressbar.

Install:

pip install web2db

Quickstart:

import web2db  
web2db.dump('data.db', urls=[
    'https://www.google.com',
    'https://www.yahoo.com',
    'https://www.msn.com'
])

Query the DB file:

df = web2db.to_df(sqlite3_file_path)
print(df.shape)
print(df)

SQL Schema:

  • Table:
    • WebPages

      url fulltext status_code
      text text int

Features:

  • Resumable webpage fetching
  • Saves to local SQLITE3 DB
  • tqdm progress bar
Popular Sqlite Projects
Popular Webpage Projects
Popular Data Storage Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Sqlite3
Fetch
Webpage
Progress Bar