Awesome Open Source

Programming Languages

Search results for mscoco

46 search results found

Swin Transformer ⭐ 12,215

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

A Pytorch Tutorial To Image Captioning ⭐ 2,084

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Ml Cvnets ⭐ 1,543

CVNets: A library for training computer vision networks

Bottom Up Attention ⭐ 979

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Hrnet Object Detection ⭐ 549

Object detection with multi-level representations generated from deep high-resolution representation learning (HRNetV2h). This is an official implementation for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919

Swin Transformer Object Detection ⭐ 513

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Pytorch Superpoint ⭐ 376

Superpoint Implemented in PyTorch: https://arxiv.org/abs/1712.07629

Edgenets ⭐ 340

This repository contains the source code of our work on designing efficient CNNs for computer vision

Adaptis ⭐ 335

[ICCV19] AdaptIS: Adaptive Instance Selection Network, https://arxiv.org/abs/1909.07829

Keypoint_communities ⭐ 266

[ICCV '21] In this repository you find the code to our paper "Keypoint Communities".

This is an official implementation for "Contextual Transformer Networks for Visual Recognition".

Vitae Transformer ⭐ 187

The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"

Ow Detr ⭐ 151

[CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer

Video Platform for Action Recognition and Object Detection in Pytorch

Scene_generation ⭐ 145

A PyTorch implementation of the paper: Specifying Object Attributes and Relations in Interactive Scene Generation

Swa_object_detection ⭐ 128

SWA Object Detection

Robust Detection Benchmark ⭐ 106

Code, data and benchmark from the paper "Benchmarking Robustness in Object Detection: Autonomous Driving when Winter is Coming" (NeurIPS 2019 ML4AD)

Imagenetmodel ⭐ 101

Official ImageNet Model repository

Varifocalnet ⭐ 99

VarifocalNet: An IoU-aware Dense Object Detector

Coco Assistant ⭐ 88

Helper for dealing with MS-COCO annotations

Hrnet Fcos ⭐ 82

High-resolution Networks for the Fully Convolutional One-Stage Object Detection (FCOS) algorithm

Multiple Objects Gan ⭐ 81

Implementation for "Generating Multiple Objects at Spatially Distinct Locations" (ICLR 2019)

Bmaskr Cnn ⭐ 81

Boundary-preserving Mask R-CNN (ECCV 2020)

Semantic Propositional Image Caption Evaluation

Semantic Object Accuracy For Generative Text To Image Synthesis ⭐ 74

Code for "Semantic Object Accuracy for Generative Text-to-Image Synthesis" (TPAMI 2020)

Mobilenetv3_centernet ⭐ 49

A tensorflow implement mobilenetv3 centernet, which can be easily deployeed on android(MNN) and ios(CoreML).

Image_captioning ⭐ 40

generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset

Coco Caption ⭐ 39

Adds SPICE metric to coco-caption evaluation server codes

Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition

Labelformat ⭐ 36

A tool for converting computer vision label formats.

A repository to support the development of a repository and interchange format for weed identification annotation

Visually Informed Embedding Of Word View ⭐ 28

Visually informed embedding of word (VIEW) is a tool for transferring multimodal background knowledge to NLP algorithms.

Shelfnet Human Pose Estimation ⭐ 26

Fast and accurate Human Pose Estimation using ShelfNet with PyTorch

Consistency ⭐ 19

Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models

Videoyolo ⭐ 17

Object Detection for Video with MXNet and GluonCV using YOLOv3

Official implementation of "Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentation"

Deepmask ⭐ 14

A Keras implementation of DeepMask based on NIPS 2015 paper "Learning to Segment Object Candidates"

Deep Learning ⭐ 14

Side projects and hands-on work

Image To Coco Json Converter ⭐ 13

Convert segmentation mask images to COCO JSON format

Ladderloss ⭐ 9

Ladder Loss for Coherent Visual-Semantic Embedding, AAAI, 2020

Jakarnotator ⭐ 7

The Jakarnotator is an annotation tool to create your own database for instance segmentation problem.

Digivision ⭐ 7

A deep learning based application which is entitled to help the visually impaired people. The application automatically generates the textual description of what's happening in front of the camera and conveys it to person through audio. It is capable of recognising faces and tell user whether a known person is standing in front of him or not.

We aim to generate realistic images from text descriptions using GAN architecture. The network that we have designed is used for image generation for two datasets: MSCOCO and CUBS.

Chitra Varnan ⭐ 5

Hindi Image Captioning

Yolov4 Pytorch ⭐ 5

Implementation of Darknet with You Only Look Once (YOLO) in Pytorch

Civic_issue_dataset ⭐ 5

Civic Issue Detection Dataset from Adversarial Adaptation of Scene Graph Models for Understanding Civic Issues

1-46 of 46 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.