Awesome Open Source

Programming Languages

Search results for clips

239 search results found

Editly ⭐ 4,303

Slick, declarative command line video editing & API

Pushdeer ⭐ 4,285

开放源码的无App推送服务，iOS14+扫码即用。亦支持快应用/iOS和Mac客户端、Android

Marqo ⭐ 3,893

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Mmpretrain ⭐ 3,177

OpenMMLab Pre-training Toolbox and Benchmark

Chinese Clip ⭐ 2,816

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Zero_nlp ⭐ 2,248

中文nlp解决方案(大模型、数据、模型、训练、推理)

Clip Interrogator ⭐ 2,181

Image to prompt with BLIP and CLIP

Clip Retrieval ⭐ 1,949

Easily compute clip embeddings and build a clip retrieval system with them

Rwidgethelper ⭐ 1,604

Android UI 快速开发，专治原生控件各种不服

Awesome Openai Vision Api Experiments ⭐ 1,483

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

Bilix ⭐ 1,433

⚡️Lightning-fast async download tool for bilibili and more | 快如闪电的异步下载工具，支持bilibili及更多

Vlm_survey ⭐ 1,405

Vision-Language Models for Vision Tasks: A Survey

Hcaptcha Challenger ⭐ 1,247

🥂 Gracefully face hCaptcha challenge with MoE(ONNX) embedded solution.

Natural Language Youtube Search ⭐ 783

Search inside YouTube videos using natural language

Awesome Clip ⭐ 782

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

Aphantasia ⭐ 757

CLIP + FFT/DWT/RGB = text to image/video

Natural Language Image Search ⭐ 741

Search photos on Unsplash using natural language

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Stable Diffusion Ncnn ⭐ 701

Stable Diffusion in NCNN with c++, supported txt2img and img2img

Clip4clip ⭐ 663

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Text2live ⭐ 642

Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

Video Chatgpt ⭐ 590

"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

🧞 No-code tool for creating a neural search solution in minutes

React Truncate ⭐ 549

React component for truncating multi-line spans and adding an ellipsis.

Keras_cv_attention_models ⭐ 523

Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,effi alias kecam

Transformer Mm Explainability ⭐ 490

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Zmjimageeditor ⭐ 449

ZMJImageEditor is a picture editing component like WeChat. It is powerful and easy to integrate, supporting rendering, text, rotation, tailoring, mapping and other functions. (ZMJImageEditor 是一个和微信一样图片编辑的组件，功能强大，极易集成，支持绘制、文字、旋转、剪裁、贴图等功能)

Openscene ⭐ 443

[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies

Video_features ⭐ 397

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.

Skypaint Ai Diffusion ⭐ 356

基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本，可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.

Clip.cpp ⭐ 335

CLIP inference in plain C/C++ with no extra dependencies

Easyreveal ⭐ 329

Android Easy Reveal Library

Cliport ⭐ 297

CLIPort: What and Where Pathways for Robotic Manipulation

Autocut Client ⭐ 274

Clipstyler ⭐ 252

Official Pytorch implementation of "CLIPstyler:Image Style Transfer with a Single Text Condition" (CVPR 2022)

PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，simsiam, SwAV, BEiT，MAE 等图像自监督算法以及 Vision Transformer，DEiT，Swin Transformer，CvT，T2T-ViT，MLP-Mixer，XCiT，ConvNeXt，PV 等基础视觉算法

Awesome Foundation And Multimodal Models ⭐ 223

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code]

Disco_diffusion_local ⭐ 213

Getting the latest versions of Disco Diffusion to work locally, instead of colab. Including how I run this on Windows, despite some Linux only dependencies ;)

Targetclip ⭐ 208

[ECCV 2022] Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.

Ksymediaeditorkit_android ⭐ 197

金山云短视频编辑SDK Android版本,合成速度快,支持抖动、冲击波、灵魂出窍等特效滤镜 Short video editor SDK powered by KSYUN, which makes it easy to capture, create, view, edit and share your clips and playback anywhere

Fashion Clip ⭐ 189

FashionCLIP is a CLIP-like model fine-tuned for the fashion domain.

Clip Iqa ⭐ 183

[AAAI 2023] Exploring CLIP for Assessing the Look and Feel of Images

GenSim: Generating Robotic Simulation Tasks via Large Language Models

Paddlemix ⭐ 172

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

Flutter Shapeofview ⭐ 163

Give a custom shape to any flutter widget, Material Design 2 ready

CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)

Clipspy ⭐ 152

Python CFFI bindings for the 'C' Language Integrated Production System CLIPS

Vlmevalkit ⭐ 137

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 30+ HF models, 15+ benchmarks

Clip Italian ⭐ 133

CLIP (Contrastive Language–Image Pre-training) for Italian

Stylegan3 Clip Notebooks ⭐ 132

A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.

Chinese Text Generation/中文文本生成,诗词曲联生成、歌词生成、现代诗生成、问题扩增、自动摘要、文言文翻

Clip Image Sorter ⭐ 128

Sort a folder of images according to their similarity with provided text in your browser (uses a browser-ported version of OpenAI's CLIP model and the web's new File System Access API)

Vodrecovery ⭐ 126

The purpose of this script is to obtain videos or clips that are either marked as "sub-only" or have been deleted on Twitch.

Language Models Can See: Plugging Visual Controls in Text Generation

Ru Clip ⭐ 122

CLIP implementation for Russian language

Clip Onnx ⭐ 122

It is a simple library to speed up CLIP inference up to 3x (K80 GPU)

Segment Anything Clip ⭐ 119

Connecting segment-anything's output masks with the CLIP model; Awesome-Segment-Anything-Works

Vand April Gan ⭐ 113

[CVPR 2023 Workshop] VAND Challenge: 1st Place on Zero-shot AD and 4th Place on Few-shot AD

Scaling Laws Openclip ⭐ 112

Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)

Picquery ⭐ 108

🔍 Search local images with natural language on Android, powered by OpenAI's CLIP model. 在 Android 上用自然语言搜索本地图片 (基于 OpenAI 的 CLIP 模型)

Clip Caption Reward ⭐ 104

PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)

[CVPR2023] Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective

Vision Language Models Are Bows ⭐ 95

Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023

Instruct2act ⭐ 94

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

Clip4cir ⭐ 92

[ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features

Music Id ⭐ 90

🎧 Automatically detect music running in Twitch streams. Effortlessly identifies music in real-time, making it easy for both streamers and viewers to discover new music while watching Twitch streams.

Vqgan Clip App ⭐ 90

Local image generation using VQGAN-CLIP or CLIP guided diffusion

Expert Systems for Python

Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP). Accepted at NAACL 2022.

Prompt Align ⭐ 87

[ICCV23] Prompt-aligned Gradient for Prompt Tuning

[ICCV 2023] Official implementation of "PØDA: Prompt-driven Zero-shot Domain Adaptation"

PyTorch code for MUST

Yasd Discord Bot ⭐ 83

Yet Another Stable Diffusion Discord Bot

Vip Llava ⭐ 81

ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Moleculestm ⭐ 81

Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-

Twitch Downloader ⭐ 80

Download Twitch VODs and Clips

Speechclip ⭐ 80

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022

Videoclip ⭐ 79

Easily create videoclips with mpv.

Promptdet ⭐ 77

PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022

Diffusion Explainer ⭐ 77

Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion

[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion

Nadeshiko ⭐ 76

A Linux tool to cut short videos with ffmpeg.

Transformers ⭐ 75

Everything you need to know about Transformers! 🤖

Natural Language Joint Query Search ⭐ 70

Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.

ICCV 2023: CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No

Pathology Language and Image Pre-Training (PLIP) is the first vision and language foundation model for Pathology AI. PLIP is a large-scale pre-trained model that can be used to extract visual and language features from pathology images and text description. The model is a fine-tuned version of the original CLIP model.

Clip Imagesearch Ncnn ⭐ 67

CLIP⚡NCNN⚡基于自然语言的图片搜索(Image Search)⚡以字搜图⚡x86⚡Android

Cropbitmap ⭐ 66

CLIPfa: Connecting Farsi Text and Images

Obs Alternative To Shadowplay ⭐ 65

An OBS Studio Guide to Replace NVIDIA Shadowplay

Youtube Clips Automator ⭐ 64

MARCELO: an AI powered bot to automate the editing and thumbnail creation for your Youtube clips channel

Gui Youtube Dl ⭐ 64

A cross platform GUI for youtube-dl written entirely in python using the WX library.

[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "

Design a growing artistic exhibit of your own making, with semantic search powered by OpenAI CLIP

Beyond Inet ⭐ 62

Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"

Fast_text2stylegan ⭐ 62

Official repo of Text-Free Learning of a Natural Language Interface for Pretrained Face Generators

Clip Container ⭐ 58

A containerized REST API around OpenAI's CLIP model.

Clip_surgery ⭐ 55

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

React Image Process ⭐ 54

🎨 a image process component for react

1-100 of 239 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.