Skip to content
View Bigwhitepear's full-sized avatar
🌍
studying
🌍
studying

Block or report Bigwhitepear

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
78 stars written in Python
Clear filter

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,486 32,895 Updated Apr 16, 2026

The agent engineering platform

Python 133,790 22,112 Updated Apr 16, 2026

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 80,041 15,168 Updated May 10, 2024

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 70,438 8,385 Updated Jan 25, 2026

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,656 5,144 Updated Apr 16, 2026

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

Python 36,261 10,899 Updated Nov 15, 2025

deep learning for image processing including classification and object-detection etc.

Python 26,202 8,243 Updated Jan 1, 2026

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

Python 15,022 2,126 Updated Apr 16, 2026

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 14,168 3,022 Updated Mar 19, 2026

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

Python 12,935 3,050 Updated Dec 17, 2025

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 11,485 1,837 Updated Apr 15, 2026

Open source annotation tool for machine learning practitioners.

Python 10,623 1,834 Updated Apr 14, 2026

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 10,202 1,390 Updated Jul 15, 2025

A PyTorch implementation of EfficientNet

Python 8,220 1,539 Updated Apr 8, 2022

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Python 7,485 1,200 Updated Aug 24, 2022

Implementation of Graph Convolutional Networks in TensorFlow

Python 7,371 2,010 Updated Apr 14, 2023

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

Python 6,703 986 Updated Nov 5, 2022

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。

Python 6,430 1,161 Updated Jan 12, 2026

Code release for ConvNeXt model

Python 6,354 738 Updated Jan 8, 2023

A treasure chest for visual classification and recognition powered by PaddlePaddle

Python 5,804 1,197 Updated Apr 1, 2026

中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。

Python 5,729 1,269 Updated Sep 23, 2020

keras implement of transformers for humans

Python 5,420 922 Updated Nov 11, 2024

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,991 1,348 Updated Mar 18, 2026

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python 4,962 426 Updated Feb 14, 2026

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,251 545 Updated Feb 6, 2026

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,142 381 Updated Aug 13, 2024

An Open-Source Package for Knowledge Embedding (KE)

Python 4,034 986 Updated Jan 10, 2024

A deep learning library for video understanding research.

Python 3,555 434 Updated Jan 12, 2026

Graph Attention Networks (https://arxiv.org/abs/1710.10903)

Python 3,515 675 Updated Apr 9, 2022

Pytorch implementation of the Graph Attention Network model by Veličković et. al (2017, https://arxiv.org/abs/1710.10903)

Python 3,120 702 Updated Jul 6, 2023
Next