Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for html corpus
corpus
x
html
x
62 search results found
Fuzzdata
⭐
486
Fuzzing resources for feeding various fuzzers with input. 🔧
Pyate
⭐
242
PYthon Automated Term Extraction
Pre Modern_chinese_corpus_dataset
⭐
132
近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言
Tutorialbank
⭐
85
Tom
⭐
78
A library for topic modeling and browsing
Wordfreq
⭐
72
Text corpus calculation in Javascript. Supports Chinese, English.
Quanteda.dictionaries
⭐
64
Dictionaries for text analysis
Opencitations
⭐
62
OpenCitations provides in RDF accurate citation information harvested from the scholarly literature.
Biotoolsregistry
⭐
56
biotoolsregistry : discovery portal for bioinformatics
Tamil Nlp Catalog
⭐
51
Awesome List of Tamil NLP & AI Resources
Chinese Mandarin Dictionaries
⭐
49
中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.
Spect
⭐
48
SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/
Polminer
⭐
45
R-package for text mining with the Corpus Workbench (CWB) as backend
Dialogueact Tagger
⭐
42
A resource to create a multi domain Dialog Act Tagger for conversational agents using publicly available data
Cryptics
⭐
27
Cryptic crossword solver
Onthebooks
⭐
22
Files for the On The Books project
Clamav Fuzz Corpus
⭐
22
Seed Corpus for clamav-devel oss-fuzz integration.
Corpus Tools
⭐
22
Various functions to make bag-of-words approaches to text analysis more user-friendly
Quanteda.classifiers
⭐
21
quanteda textmodel extensions for classifying documents
Learningr
⭐
20
Helpful resources for learning R
Chinesenotes.com
⭐
19
Chinese Notes: A digital library for classical and historic Chinese texts with built in dictionary and reader
Saffron
⭐
19
Saffron 3 - Text Analysis and Insight Tool
Next Word Prediction
⭐
19
Application which predicts next word based on what was entered. Uses twits, blogs and newspapers corpora as a training set.
Nsmc_study
⭐
18
Naver sentiment movie corpus classification
Pelic Dataset
⭐
18
The University of Pittsburgh English Language Institute Corpus (PELIC) dataset
Deep Learning Nlp Sais
⭐
16
Roman18
⭐
16
Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)
Finer Data
⭐
15
Discursos De Navidad
⭐
14
A corpus of the Christmas speeches delivered by the head of state of Spain from 1937 to 2019
Lknlp.github.io
⭐
13
Dahnproject
⭐
13
Project DAHN "Digital Edition of historical manuscripts (correspondences)"
Hanzifreq
⭐
13
Chinese Character Frequencies
Productclassification
⭐
13
A playground for classifying products based on image and text features using deep learning.
Ynacc
⭐
13
The Yahoo News Annotated Comments Corpus (YNACC)
Textsummarizer
⭐
11
A text summarization tool for Marathi implemented as a project for course Adavanced NLP (CSCI 544)
Relna
⭐
11
Biomedical Relation Extraction for Transcription Factor and Gene / Gene Products (part of a Master Thesis at Rostlab, TUM)
Case2vec
⭐
10
A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated Council of Law Reporting for England & Wales (https://www.iclr.co.uk).
Nlp_demo
⭐
10
NLP Common Analytical Ideas
Ubiqu Ity
⭐
9
Tool for tagging texts in a corpus using the Docuscope Dictionary. Provides metadata in the form of cumulative csvs, individual token files, and rule csvs.
Sentimentator
⭐
8
Tool for sentiment analysis annotation
Spark_workshop
⭐
8
An introductory workshop to using Spark in data analysis
Muc3
⭐
8
Message Understanding Conference 3 Corpus
Chinese_sf
⭐
7
Textmining Bootcamp
⭐
7
a set of scripts, narrative texts, and data forming the structure for a text mining workshop
Middlemarch Critical Histories
⭐
7
Computational analyses of the critical history of George Eliot's Middlemarch
Justext Java
⭐
7
News Review Pickup
⭐
6
新闻人物言论自动提取---->得到说话的人和说话的内容
Unitednations
⭐
6
United Nations General Debate corpus
Pythainlp Corpus
⭐
6
pythainlp-data
Pata.physics.wtf
⭐
6
Pataphysics; what the fuck?
Perseusnlptoolkit
⭐
6
A bunch of modules that use/extend CLTK in order to work with Greek and Latin corpora maintained by the Perseus DL
Capstone
⭐
6
Iust Htmlchardet
⭐
6
A java tool for detecting charset encoding of HTML web pages
Germaparl
⭐
6
GermaParl R Data Package
Minimal Pairs
⭐
5
Tool for finding minimal pairs given a corpus of words
Sisyphe
⭐
5
Memorize poetry by hiding words progressively
Opencourt
⭐
5
This is a repo for a python scraper to build a corpus of U.S. Supreme Court Cases and then build a network graph from the citations. While not scalable for the full corpus, the repo also includes some server-side D3.js rendering for the resulting graph json.
Instagramfashionblogger
⭐
5
An Instagram fashion blogger recommendation app based on deep learning and natural language processing.
Emtk
⭐
5
Chbs
⭐
5
An implementation of http://xkcd.com/936/
Hackingsymptoms
⭐
5
Docs and Code repository for Hackathon 2020: Diagnosing rare diseases with AI - Hacking symptoms
Speech Recognition
⭐
5
Automatic Speech Recognition on the Digital Archive of the Southern Speech
Related Searches
Javascript Html (53,392)
Html Css (19,526)
Python Html (11,009)
Html Bootstrap (5,651)
Php Html (5,615)
Html Theme (5,550)
Html Jekyll (5,387)
Html Jquery (5,205)
Html Markdown (5,082)
Html Reactjs (4,782)
1-62 of 62 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.