-
Meta
- United States
Highlights
Stars
Robust Speech Recognition via Large-Scale Weak Supervision
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
A Blender script to procedurally generate 3D spaceships
Python package for the evaluation of odometry and SLAM
An all-in-one Docker image for deep learning. Contains all the popular DL frameworks (TensorFlow, Theano, Torch, Caffe, etc.)
Pointcept: Perceive the world with sparse points, a codebase for point cloud perception research. Latest works: Concerto (NeurIPS'25), Sonata (CVPR'25 Highlight), PTv3 (CVPR'24 Oral)
A library for differentiable nonlinear optimization
Cramming the training of a (BERT-type) language model into limited compute.
This toolkit was designed for the fast and efficient development of modern machine comprehension models, including both published models and original prototypes.
AttentionGAN for Unpaired Image-to-Image Translation & Multi-Domain Image-to-Image Translation
[CVPR'25 Highlight] Official repository of Sonata: Self-Supervised Learning of Reliable Point Representations
A Hearthstone: Heroes of WarCraft Simulator for the purposes of Machine Learning and Data Mining
Code for "Point-based Multi-view Stereo Network" (ICCV 2019 Oral) & "Visibility-aware Point-based Multi-view Stereo Network" (TPAMI)
[NeurIPS'25] Official repository of Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations
SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
[CVPR 2019 Oral] Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation
Official source code of "Batch DropBlock Network for Person Re-identification and Beyond" (ICCV 2019)
GL3D (Geometric Learning with 3D Reconstruction): a large-scale database created for 3D reconstruction and geometry-related learning problems
Implementation of CVPR'20 paper - ASLFeat: Learning Local Features of Accurate Shape and Localization
[TensorFlow] Official implementation of CVPR'20 oral paper - D3Feat: Joint Learning of Dense Detection and Description of 3D Local Features https://arxiv.org/abs/2003.03164
Implementation of ICCV19 Paper "Learning Two-View Correspondences and Geometry Using Order-Aware Network"
Cost Volume Pyramid Based Depth Inference for Multi-View Stereo (CVPR 2020 Oral)
Implementation of CVPR'19 paper (oral) - ContextDesc: Local Descriptor Augmentation with Cross-Modality Context
KFNet: Learning Temporal Camera Relocalization using Kalman Filtering (CVPR 2020 Oral)
[ECCV 2020] XingGAN for Person Image Generation
HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos, CVPR 2025
Implementation of ECCV'18 paper - GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints
[ACM MM 2018 Oral] GestureGAN for Hand Gesture-to-Gesture Translation in the Wild


