Stars
Tesseract Open Source OCR Engine (main repository)
Reference PyTorch implementation and models for DINOv3
The simplest, fastest repository for training/finetuning small-sized VLMs.
This program is implemented to count the number of cells in the image. The cells are also labeled and the perimeter and area are calculated for each cell.
Count-Ception: Counting by Fully Convolutional Redundant Counting
Scalable Instance Segmentation using PyTorch & PyTorch Lightning.
A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience
An open-source AI agent that brings the power of Gemini directly into your terminal.
A collection of guides and examples for the Gemma open models from Google.
Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini). Supports local and cloud LLMs (Ollama, Google, Anthropic, ...). Searches 10+ sources - arXiv, PubMed, web, and yo…
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, dif…
NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Solve Visual Understanding with Reinforced VLMs
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
Integrate the DeepSeek API into popular softwares
#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere
Agentic components of the Llama Stack APIs
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
✔(已完结)超级全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】
Effortless data labeling with AI support from Segment Anything and other awesome models.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Wanna know what your model sees? Here's a package for applying EigenCAM and generating heatmap from the new YOLO V12 model
最新公务员考试、公考资料免费分享,涵盖申论范文、申论真题库、公考面试真题、申论素材等,助你轻松备考!
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Model Context Protocol Servers