Textmining

In this project, there are two major tasks: text data processing and text categorization. In text data processing, we have done tokenization, stemming, normalization, etc. Also, vector space model and statistical language models are used to retrieve similar documents to query. In text categorization, we build a text classification system which includes feature selection, classifiers (Naive Bayes and K Nearest Neighbor using brute force and random vectors), cross validation, and parameter tuning.

Categories > Computer Science > Vector

Suggest Alternative

Stars

License

No license specified

Open Issues

Most Recent Commit

8 years ago

Programming Language

Python

Categories

Programming Languages > Python

Computer Science > Vector

Security > Brute Force

Data Processing > Data Processing

Repo

Suggest An Alternative To TextMining

Popular Vector Projects

Supabase ⭐ 62,208

The open source Firebase alternative.

dependent packages 2total releases 36latest release March 16, 2020most recent commit 5 months ago

Meilisearch ⭐ 43,646

A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

most recent commit a month ago

Quivr ⭐ 27,485

Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.

most recent commit 5 months ago

Milvus ⭐ 25,219

A cloud-native vector database, storage for next generation AI applications

total releases 35latest release April 02, 2022most recent commit 5 months ago

Vector ⭐ 16,326

A high-performance observability data pipeline.

total releases 3latest release March 13, 2021most recent commit 3 months ago

Popular Brute Force Projects

Routersploit ⭐ 11,910

Exploitation Framework for Embedded Devices

most recent commit 2 months ago

Thc Hydra ⭐ 8,480

hydra

most recent commit 7 months ago

K8tools ⭐ 5,502

K8工具合集(内网渗透/提权工具/远程溢出/漏洞利用/扫描工具/密码破解/免杀工具/Exploit/ Web GetShell Exploit(Struts2/Zimbra/Weblogic/Tomcat/Apache/Jbos

most recent commit 6 months ago

Scan4all ⭐ 5,343

Official repository vuls Scan: 15000+PoCs; 23 kinds of application password crack; 7000+Web fingerprints; 146 protocols and 90000+ rules Port scanning; Fuzz, HW, awesome BugBounty( ͡° ͜ʖ ͡°)...

most recent commit 3 months ago

Ladon ⭐ 4,564

Ladon大型内网渗透工具，可PowerShell模块化、可CS插件化、可内存加载，无文件扫描。含端 12.2内置262个功能,网络资产探测模块32个通过多种协议(ICMP\NBT\DNS\MAC\SM

total releases 8latest release June 05, 2023most recent commit 6 months ago

Popular Computer Science Categories

Get A Weekly Email With Trending Projects For These Categories

No Spam. Unsubscribe easily at any time.

Python

Vector

Brute Force

Data Processing

Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.