-
bytedance
- beijing
-
08:54
(UTC -12:00)
Stars
Awesome AI Memory | LLM Memory | A curated knowledge base on AI memory for LLMs and agents, covering long-term memory, reasoning, retrieval, and memory-native system design. Awesome-AI-Memory 是一个 集…
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
The Hugging Face Course on Transformers for Audio
一个用于提取简体中文字符串中省,市和区并能够进行映射,检验和简单绘图的python模块
A series of large language models developed by Baichuan Intelligent Technology
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
A PyTorch-based library for semi-supervised learning (NeurIPS'21)
PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882
Deep Learning Visualization Toolkit(『飞桨』深度学习可视化工具 )
This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020
全中文注释.(The loss function of retinanet based on pytorch).(You can use it on one-stage detection task or classifical task, to solve data imbalance influence).用于one-stage目标检测算法,提升检测效果.你也可以在分类任务中使用该损失函…
label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
基于深度学习的肿瘤辅助诊断系统,以图像分割为核心,利用人工智能完成肿瘤区域的识别勾画并提供肿瘤区域的特征来辅助医生进行诊断。有完整的模型构建、后端架设、工业级部署和前端访问功能。TensorRT、PyTorch 、OpenCV 、Flask、Vue
PyTorch implementation of EfficientNetV2 family
[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
PyTorch-style and human-readable RegNet with a spectrum of pre-trained models
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Python bindings for FFmpeg - with complex filtering support
Yet another easy-to-use tool to extract frames from videos, for deep learning and computer vision.
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
Code for Neural Motifs: Scene Graph Parsing with Global Context (CVPR 2018)
A Hierarchical Approach for Generating Descriptive Image Paragraphs
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
