Awesome Open Source

Programming Languages

Search results for python video understanding

video-understanding x

86 search results found

Mmaction2 ⭐ 3,647

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Ask Anything ⭐ 2,404

[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Mmaction ⭐ 1,415

An open-source toolbox for action understanding based on PyTorch

Paddlevideo ⭐ 1,355

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.

Temporal Segment Networks ⭐ 1,235

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

Videomae ⭐ 864

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Tsn Pytorch ⭐ 751

Temporal Segment Networks (TSN) in PyTorch

Internvideo ⭐ 736

InternVideo: General Video Foundation Models via Generative and Discriminative Learning (https://arxiv.org/abs/2212.03191)

Action Detection ⭐ 551

temporal action detection with SSN

[ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions

Chat Univi ⭐ 382

Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

End-to-End Learning of Motion Representation for Video Understanding

Multiverse ⭐ 222

Dataset, code and model for the CVPR'20 paper "The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction". And for the ECCV'20 SimAug paper.

Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral

Actionvlad ⭐ 201

ActionVLAD for video action classification (CVPR 2017)

Tadaconv ⭐ 177

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

Youtube 8m ⭐ 166

The 2nd place Solution to the Youtube-8M Video Understanding Challenge by Team Monkeytyping (based on tensorflow)

deep learning sex position classifier

Text4vis ⭐ 149

【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective

Cap4video ⭐ 149

【CVPR'2023 Highlight】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

Object_level_visual_reasoning ⭐ 148

Pytorch Implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori , ECCV 2018

Video2tfrecord ⭐ 142

Easily convert RGB video data (e.g. .avi) to the TensorFlow tfrecords file format for training e.g. a NN in TensorFlow. This implementation allows to limit the number of frames per video to be stored in the tfrecords.

CVPR2019 STEP: Spatio-Temporal Progressive Learning for Video Action Detection

Videomaev2 ⭐ 134

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models

Tubedetr ⭐ 127

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

[CVPR 2020] Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation (PyTorch)

Frozenbilm ⭐ 120

[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

I3d_finetune ⭐ 104

TensorFlow code for finetuning I3D model on UCF101.

[ICCV 2021 Oral] Deep Evidential Action Recognition

【AAAI 2021】MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Motionsqueeze ⭐ 92

Official PyTorch Implementation of MotionSqueeze, ECCV 2020

Pyanomaly ⭐ 92

Useful Toolbox for Anomaly Detection

S3d.pytorch ⭐ 89

Spatiotemporal-separable 3D convolution network.

Mmpd_rppg_dataset ⭐ 75

MMPD: Multi-Domain Mobile Video Physiology Dataset(EMBC2023 Oral)

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

Efficient 3D Backbone Network for Temporal Modeling

[AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning

[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "

Din Group Activity Recognition Benchmark ⭐ 53

[ICCV 2021] A new codebase containing various methods for Group Activity Recognition. Paper title: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition.

Graph_distillation ⭐ 51

Graph Distillation for Action Detection

Temporally Language Grounding ⭐ 49

A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"

[NeurIPS 2019] Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition

Icme2019 Ctr ⭐ 47

The Code for ICME2019 Grand Challenge: Short Video Understanding (Single Model Ranks 6th)

Temporal Shift Module ⭐ 46

Unofficial implementation for paper `Temporal Shift Module for Efficient Video Understanding`

Code release for "Training a Large Video Model on a Single Machine in a Day"

I3d Tensorflow ⭐ 45

Inflated 3D ConvNets for video understanding

Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"

What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment

[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)

Pointtad ⭐ 30

[NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points

[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding

Glimpse_clouds ⭐ 28

Pytorch implementation of the paper "Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points", F. Baradel, C. Wolf, J. Mille , G.W. Taylor, CVPR 2018

[Codes of CVPR'21 paper] CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning

Pi Consistency Activity Detection ⭐ 26

End-to-End Semi-Supervised Learning for Video Action Detection [CVPR 2022]

STCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection

Rdn4depth ⭐ 22

Region Deformer Networks for Unsupervised Depth Estimation from Unconstrained Monocular Videos, IJCAI 2019

Graph learning framework for long-term video understanding

[IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning

[IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition

Progressive Action Prediction ⭐ 18

[CVPR 2023] Code for action prediction from videos

Soccerdb ⭐ 17

SoccerDB: A Large-Scale Database for Comprehensive Video Understanding

[ECCV 2022] Official Pytorch Implementation of the paper : " Semi-Supervised Temporal Action Detection with Proposal-Free Masking "

Fitness Aqa ⭐ 16

Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]

Ltcontext ⭐ 16

[ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?

Tem Adapter ⭐ 15

[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer

Image Pretraining For Video ⭐ 15

[ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recognition".

Videomae Action Detection ⭐ 13

[NeurIPS 2022 Spotlight] VideoMAE for Action Detection

Orbit 2022 Winner Method ⭐ 13

Few-Shot Video Object Recognition with Embedding Adaptation and Uniform Clip Sampling: Winner of ORBIT Few-Shot Object Recognition Challenge 2022

Cp 360 Weakly Supervised Saliency ⭐ 13

CP-360-Weakly-Supervised-Saliency

Region Based Non Local Network ⭐ 11

[Codes of paper]: Region-based Non-local operation for Video Classification

Piano Skills Assessment ⭐ 10

Piano Skills Assessment [IEEE MMSP 2021]

Deepepisodicmemory ⭐ 10

Deep neural network architecture for representing robot experiences in an episodic-like memory which facilitates encoding, recalling, and predicting action experience - Research Project at KIT's High Performance Humanoids Technologies Lab (H2T)

Cvpr2023 Cmpae ⭐ 10

[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception

C3d Lstm Pytorch ⭐ 9

C3D-LSTM implementation in PyTorch

VTC: Improving Video-Text Retrieval with User Comments

Revisiting Spatial Temporal Layouts ⭐ 8

Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).

Official Implementation for "Fast Weakly Supervised Action Segmentation Using Mutual Consistency" - TPAMI 2021

[ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "

Verb_ambiguity ⭐ 8

Official implementation of "An Action Is Worth Multiple Words: Handling Ambiguity in Action Recognition", BMVC 2022

Pan Pytorch ⭐ 7

[Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance

Improving Transfer Learning with a Dual Image and Video Transformer for Multi-label Movie Trailer Genre Classification

Related Searches

Python Django (28,897)

Python Machine Learning (20,195)

Python Deep Learning (17,861)

Python Flask (17,643)

Python Pytorch (14,858)

Python Dataset (14,792)

Python Tensorflow (13,990)

Python Docker (13,757)

Python Command Line (13,351)

Python Jupyter Notebook (12,976)

1-86 of 86 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.