Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for mscoco
mscoco
x
46 search results found
Swin Transformer
⭐
12,215
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
A Pytorch Tutorial To Image Captioning
⭐
2,084
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Ml Cvnets
⭐
1,543
CVNets: A library for training computer vision networks
Bottom Up Attention
⭐
979
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Hrnet Object Detection
⭐
549
Object detection with multi-level representations generated from deep high-resolution representation learning (HRNetV2h). This is an official implementation for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919
Swin Transformer Object Detection
⭐
513
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
Pytorch Superpoint
⭐
376
Superpoint Implemented in PyTorch: https://arxiv.org/abs/1712.07629
Edgenets
⭐
340
This repository contains the source code of our work on designing efficient CNNs for computer vision
Adaptis
⭐
335
[ICCV19] AdaptIS: Adaptive Instance Selection Network, https://arxiv.org/abs/1909.07829
Keypoint_communities
⭐
266
[ICCV '21] In this repository you find the code to our paper "Keypoint Communities".
Cotnet
⭐
201
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
Vitae Transformer
⭐
187
The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"
Ow Detr
⭐
151
[CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer
Vip
⭐
146
Video Platform for Action Recognition and Object Detection in Pytorch
Scene_generation
⭐
145
A PyTorch implementation of the paper: Specifying Object Attributes and Relations in Interactive Scene Generation
Swa_object_detection
⭐
128
SWA Object Detection
Robust Detection Benchmark
⭐
106
Code, data and benchmark from the paper "Benchmarking Robustness in Object Detection: Autonomous Driving when Winter is Coming" (NeurIPS 2019 ML4AD)
Imagenetmodel
⭐
101
Official ImageNet Model repository
Varifocalnet
⭐
99
VarifocalNet: An IoU-aware Dense Object Detector
Coco Assistant
⭐
88
Helper for dealing with MS-COCO annotations
Hrnet Fcos
⭐
82
High-resolution Networks for the Fully Convolutional One-Stage Object Detection (FCOS) algorithm
Multiple Objects Gan
⭐
81
Implementation for "Generating Multiple Objects at Spatially Distinct Locations" (ICLR 2019)
Bmaskr Cnn
⭐
81
Boundary-preserving Mask R-CNN (ECCV 2020)
Spice
⭐
78
Semantic Propositional Image Caption Evaluation
Semantic Object Accuracy For Generative Text To Image Synthesis
⭐
74
Code for "Semantic Object Accuracy for Generative Text-to-Image Synthesis" (TPAMI 2020)
Mobilenetv3_centernet
⭐
49
A tensorflow implement mobilenetv3 centernet, which can be easily deployeed on android(MNN) and ios(CoreML).
Image_captioning
⭐
40
generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset
Coco Caption
⭐
39
Adds SPICE metric to coco-caption evaluation server codes
Mcar
⭐
37
Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition
Labelformat
⭐
36
A tool for converting computer vision label formats.
Weed Ai
⭐
33
A repository to support the development of a repository and interchange format for weed identification annotation
Visually Informed Embedding Of Word View
⭐
28
Visually informed embedding of word (VIEW) is a tool for transferring multimodal background knowledge to NLP algorithms.
Shelfnet Human Pose Estimation
⭐
26
Fast and accurate Human Pose Estimation using ShelfNet with PyTorch
Consistency
⭐
19
Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models
Videoyolo
⭐
17
Object Detection for Video with MXNet and GluonCV using YOLOv3
Vit Pcm
⭐
14
Official implementation of "Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentation"
Deepmask
⭐
14
A Keras implementation of DeepMask based on NIPS 2015 paper "Learning to Segment Object Candidates"
Deep Learning
⭐
14
Side projects and hands-on work
Image To Coco Json Converter
⭐
13
Convert segmentation mask images to COCO JSON format
Ladderloss
⭐
9
Ladder Loss for Coherent Visual-Semantic Embedding, AAAI, 2020
Jakarnotator
⭐
7
The Jakarnotator is an annotation tool to create your own database for instance segmentation problem.
Digivision
⭐
7
A deep learning based application which is entitled to help the visually impaired people. The application automatically generates the textual description of what's happening in front of the camera and conveys it to person through audio. It is capable of recognising faces and tell user whether a known person is standing in front of him or not.
Gan
⭐
6
We aim to generate realistic images from text descriptions using GAN architecture. The network that we have designed is used for image generation for two datasets: MSCOCO and CUBS.
Chitra Varnan
⭐
5
Hindi Image Captioning
Yolov4 Pytorch
⭐
5
Implementation of Darknet with You Only Look Once (YOLO) in Pytorch
Civic_issue_dataset
⭐
5
Civic Issue Detection Dataset from Adversarial Adaptation of Scene Graph Models for Understanding Civic Issues
1-46 of 46 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.