Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for ruby corpus
corpus
x
ruby
x
26 search results found
Bayes_motel
⭐
188
Multi-variate Bayesian classification engine
Autosuggest
⭐
165
Autocomplete suggestions based on what your users search
Wp2txt
⭐
160
A command-line toolkit to extract text content and category data from Wikipedia dump files
Ankusa
⭐
101
Text classifier in Ruby that uses Hadoop/HBase, Mongo, or Cassandra for storage. New location for http://github.com/livingsocial/ankusa
Kantan Ej Dictionary
⭐
44
English-Japanese dictionary
Ruby Nlp
⭐
33
Various NLP tools for Ruby
Epitome
⭐
32
A Lexrank implementation in ruby
Random Word
⭐
26
Hyperdictionary
⭐
23
a tool for compiling, showing, cross-linking, and otherwise manipulating dictionaries, corpora, linguistic phylogenies, etc.
Corpus Processor
⭐
20
Handle linguistic corpus and convert it to use NLP tools
Ruby Tf Idf
⭐
18
Ruby gem that calculates TF-IDF out of a text to find most relevant words in each document of the corpus
Correct Horse Battery Staple
⭐
18
xkcd-style password generation
Glossa
⭐
16
Ruby on Rails application that uses the Rails version of the Glossa system for corpus search and results management (https://github.com/textlab/rglossa). Includes a Dockerfile for constructing a Docker image containing the application (see https://docker.com).
Maxixe
⭐
16
A small statistical segmenter for any language.
Dejunk
⭐
13
Detect keyboard mashing and other junk in your data
Rdf Inference
⭐
12
RDF-Inference is a Ruby library for inferencing over a corpus of triples with RDFS and OWL properties.
Nytimes Solr Indexer
⭐
10
Solr Indexer for the New York Times Annotated Corpus
Madlibs
⭐
9
A library for generating mad lib sentences out of text corpuses, written for Art Hack Day: Deluge.
More Stoplists
⭐
8
stoplists for African languages generated from the ASP corpus
Tf Idf
⭐
8
A rubygem that calculates the tf-idf of a corpus.
Dont_bayes_me_bro
⭐
8
Benchmarking bayesian filtering in Ruby
Corpusbuilder
⭐
7
Corpus Build OCR platform
Parts
⭐
6
Parts is a simple to use probabilistic part of speech (POS) tagger.
Rukovsky
⭐
6
Generate poems via Markov chains.
Wwwjdic2db
⭐
5
A project for converting the kanjidic (and in future edict and Tanaka corpus) files from Jim Breen's wwwjdic project into a database format.
Poliqarpr
⭐
5
Ruby client for Poliqarp text corpus server (see http://poliqarp.sourceforge.net/)
Related Searches
Ruby Command Line (35,999)
Javascript Ruby (6,657)
Ruby Plugin (6,573)
Ruby Chef (4,661)
Ruby Testing (4,020)
Ruby Sinatra (3,377)
Ruby Rspec (3,278)
Ruby Activerecord (3,234)
Ruby Heroku (2,926)
Ruby Vagrant (2,919)
1-26 of 26 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.