Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python wikipedia
python
x
wikipedia
x
439 search results found
Wikiextractor
⭐
3,440
A tool for extracting plain text from Wikipedia dumps
Thealgorithms Python
⭐
2,140
TheAlgorithms/Python
Langdetect
⭐
1,571
Port of Google's language-detection library to Python.
Word2vec Api
⭐
1,303
Simple web service providing a word embedding model
Pywhatkit
⭐
1,193
Send WhatsApp message at certain time and many other things.
Blink
⭐
1,047
Entity Linker solution
Wikipedia2vec
⭐
899
A tool for learning vector representations of words and entities from Wikipedia
Mwparserfromhell
⭐
676
A Python parser for MediaWiki wikicode
Wikiteam
⭐
661
Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2023, WikiTeam has preserved more than 350,000 wikis.
Natural Questions
⭐
655
Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.
Wordninja
⭐
648
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
Pywikibot
⭐
603
A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See https://www.mediawiki.org/wiki/Developer_account for contributing.
Genre
⭐
586
Autoregressive Entity Retrieval
Wik
⭐
564
wik is use to get information about anything on the shell using Wikipedia.
Gensim Data
⭐
492
Data repository for pretrained NLP models and NLP corpora.
Wptools
⭐
448
Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis
Wikipedia Api
⭐
426
Python wrapper for Wikipedia
Fact Extractor
⭐
413
Fact Extraction from Wikipedia Text
Hncynic
⭐
332
Generate Hacker News Comments from Titles
Learning_to_retrieve_reasoning_paths
⭐
322
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
Chakin
⭐
313
Simple downloader for pre-trained word vectors
Worddumb
⭐
305
A calibre plugin that generates Kindle Word Wise and X-Ray files for KFX, AZW3, MOBI and EPUB eBook. Supports 24 languages.
Gpt2 Japanese
⭐
305
Japanese GPT2 Generation Model
Adam_qas
⭐
298
ADAM - A Question Answering System. Inspired from IBM Watson
Mwclient
⭐
287
Python client library to interface with the MediaWiki API
Rel
⭐
279
REL: Radboud Entity Linker
Wikitables
⭐
279
Import tables from any Wikipedia article as a dataset in Python
Airport Codes
⭐
274
List of Airport codes, locations and other information around the world
Warren
⭐
272
Links to lose yourself in, curated from HN and other sources
Mysqldump To Csv
⭐
264
A quickly-hacked-together Python script to turn mysqldump files to CSV files. Optimized for Wikipedia database dumps.
Wikipedia Extractor
⭐
247
This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wikiextractor --- Extracts and cleans text from Wikipedia database dump and stores output in a number of files of similar size in a given directory.
Wikiplots
⭐
234
A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.
Spikex
⭐
220
SpikeX - SpaCy Pipes for Knowledge Extraction
Alfred Searchio
⭐
215
Alfred workflow to auto-suggest search results from multiple search engines and languages.
Ai Personal Voice Assistant Using Python
⭐
210
Plaintextwikipedia
⭐
183
Convert Wikipedia database dumps into plaintext files
Isbntools
⭐
176
python app/framework for 'all things ISBN' including metadata, descriptions, covers...
Fasttextjapanesetutorial
⭐
174
Tutorial to train fastText with Japanese corpus
Tg Search Bot
⭐
161
一个可用于搜索各种影片磁链的电报机器人, 支持收藏, 导出记录, 自动保存磁链等操作, 可手动配置以屏蔽 NSFW 内容和代理上网。
Qb
⭐
160
QANTA Quiz Bowl AI
Mediawiki
⭐
160
MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/
Gap Coreference
⭐
159
GAP is a gender-balanced dataset containing 8,908 coreference-labeled pairs of (ambiguous pronoun, antecedent name), sampled from Wikipedia for the evaluation of coreference resolution in practical applications.
Wikipedia Word Frequency
⭐
156
Gather modern English word frequencies from all enwiki articles.
Crawlerproject
⭐
147
爬虫项目:链家网(普通/scrapy)、虎扑、维基百科、百度地图api、房天下(分布式爬虫)、微信公
The Algorithms
⭐
140
Algorithms repository
Tmnt_wikipedia_bot
⭐
138
Find Wikipedia titles that can be sung to the Teenage Mutant Ninja Turtles themesong.
Mediawiker
⭐
131
A plugin for Sublime Text editor that adds possibility to use it as Wiki Editor on MediaWiki-based sites like Wikipedia and many other.
Question Answering Albert Electra
⭐
130
Question Answering using Albert and Electra
Wikientvec
⭐
129
Distributed representations of words and named entities trained on Wikipedia.
Levitation
⭐
127
Tools to convert Wikipedia dumps into Git repositories.
Truecaser
⭐
120
Language independent truecaser in Python.
Music Emotion Recognition
⭐
117
A Machine Learning Approach of Emotional Model
Wikipedia Question Generator
⭐
115
Uses NLP and wikipedia to try to generate trivia questions
Quantulum3
⭐
112
Library for unit extraction - fork of quantulum for python3
Wikimapper
⭐
110
Mapping Wikipedia pages to Wikidata IDs and vice versa.
Naacl2018 Fever
⭐
110
Fact Extraction and VERification baseline published in NAACL2018
Causeofwhy
⭐
100
The goal of this project is to implement a Question Answering (QA) system that answers causal type questions. We use Wikipedia as a knowledge base, extracting answers to user questions from the articles.
Osm Wikidata
⭐
99
Match OSM entities with Wikidata items
Wpcorpus
⭐
98
wpcorpus - NLP corpus based on Wikipedia's full article dump
Mwlib
⭐
98
mediawiki parser library
Qwikidata
⭐
95
Python tools for interacting with Wikidata
Citationhunt
⭐
93
A fun tool for quickly browsing unsourced snippets on Wikipedia.
Quantulum
⭐
92
Python library for information extraction of quantities from unstructured text
Doc2vec Api
⭐
92
document embedding and machine learning script for beginners
Autowikibot Py
⭐
90
Reddit bot that replies to comments with excerpt from linked wikipedia article or section.
Dict2vec
⭐
88
Dict2vec is a framework to learn word embeddings using lexical dictionaries.
Annotated Wikiextractor
⭐
88
Simple Wikipedia plain text extractor with article link annotations and Hadoop support.
Ambigqa
⭐
86
An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"
Ja.text8
⭐
74
Japanese text8 corpus for word embedding.
Text Segmentation
⭐
73
Implementation of the paper: Text Segmentation as a Supervised Learning Task
Wikimon
⭐
72
A WebSocket-oriented monitor for Wikipedia (also, wikimon, wikital monsters)
Wikitextprocessor
⭐
69
Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
Ai_personal_digital_assistant
⭐
67
AI Personal Voice Assistant Project (Male - Female version)
Toolbox
⭐
67
A collection of tools, APIs and other resources to use in creative coding web projects.
Wiki Word2vec
⭐
66
Train a gensim word2vec model on Wikipedia.
Wikicurses
⭐
64
A simple curses interface for MediaWiki sites such as Wikipedia.
Ntee
⭐
64
Neural Text-Entity Encoder (NTEE)
Twlight
⭐
64
Library Card Platform for The Wikipedia Library
Tagme Python
⭐
62
Official TagMe API wrapper for Python.
Wikiedits
⭐
61
Automatic extraction of edited sentences from text edition histories.
Vor Knowledge Graph
⭐
59
🎓 Open knowledge mining and graph builder
Wiki Table Scrape
⭐
59
Scrape tables from Wikipedia articles into CSVs
Wikipedia Crawler
⭐
57
This is a program to crawl entire 'Wikipedia' and extract & store information from the pages as required.
Japanese Words To Vectors
⭐
57
Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.
Semanticizer
⭐
57
Entity Linking for the masses
Wikipedia_ner
⭐
56
📖 Labeled examples from wiki dumps in Python
Wiki Sim Search
⭐
54
Similarity search on Wikipedia using gensim in Python.
Tech Seo Crawler
⭐
54
Build a small, 3 domain internet using Github pages and Wikipedia and construct a crawler to crawl, render, and index.
Mcc_mnc
⭐
53
Providing accurate JSON and Python dicts about the many public information available about MNO
Convec
⭐
53
In this project, we use skip-gram model to embed Wikipedia Concepts and Entities. The English version of Wikipedia contains more than five million pages, which suggest its capability to cover many English Entities, Phrases, and Concepts. Each Wikipedia page is considered as a concept.
Chainer Slack Twitter Dialogue
⭐
52
Chainer-Slack-Twitter-Dialogue
Hottosns W2v
⭐
51
hottoSNS-w2v: 日本語大規模SNS+Webコーパスによる単語分散表現モデル
Termtype
⭐
50
A Curses-based Python application to practice touch-typing by typing out random Wikipedia articles
Wiki Auto
⭐
50
Neural CRF Model for Sentence Alignment in Text Simplification
5ch Analysis
⭐
48
5chの過去ログをスクレイピングして、過去流行った単語(ex, 香具師, orz)などを追跡調査
Danker
⭐
48
Compute PageRank on >3 billion Wikipedia links on off-the-shelf hardware.
Crocodile
⭐
47
cRocoDiLe is a dataset extraction tool for Relation Extraction using Wikipedia and Wikidata presented in REBEL (EMNLP 2021).
Articlequality
⭐
46
A library for performing automatic detection of assessment classes of Wikipedia article text.
Codex
⭐
46
CoDEx: A set of knowledge graph Completion Datasets Extracted from Wikidata and Wikipedia
Jawiki Kana Kanji Dict
⭐
44
Generate SKK/MeCab dictionary from Wikipedia(Japanese edition)
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Flask (17,643)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Command Line (13,351)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Network (11,495)
1-100 of 439 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.