Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 72,549 9,996 Updated Mar 18, 2026

xtekky / gpt4free

The official gpt4free repository | various collection of powerful language models | opus 4.6 gpt 5.3 kimi 2.5 deepseek v3.2 gemini 3

Python 65,774 13,648 Updated Mar 13, 2026

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 59,530 9,423 Updated Mar 9, 2026

ultralytics / yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 57,044 17,442 Updated Mar 16, 2026

docling-project / docling

Get your documents ready for gen AI

Python 56,012 3,792 Updated Mar 17, 2026

microsoft / autogen

A programming framework for agentic AI

Python 55,833 8,413 Updated Mar 14, 2026

unslothai / unsloth

Unified web UI for training and running open models like Qwen, DeepSeek, and Gemma locally.

Python 55,380 4,653 Updated Mar 18, 2026

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 55,342 9,683 Updated Feb 11, 2026

crewAIInc / crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 46,446 6,267 Updated Mar 18, 2026

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,844 6,005 Updated Aug 16, 2024

mingrammer / diagrams

🎨 Diagram as Code for prototyping cloud system architectures

Python 42,084 2,724 Updated Feb 7, 2026

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 36,111 4,033 Updated Apr 19, 2025

google / langextract

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 34,778 2,335 Updated Feb 25, 2026

Pythagora-io / gpt-pilot

The first real AI developer

Python 33,804 3,501 Updated Nov 10, 2025

dgtlmoon / changedetection.io

Best and simplest tool for website change detection, web page monitoring, and website change alerts. Perfect for tracking content changes, price drops, restock alerts, and website defacement monito…

Python 30,704 1,723 Updated Mar 17, 2026

feder-cr / Jobs_Applier_AI_Agent_AIHawk

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

Python 29,470 4,488 Updated Nov 16, 2025

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 28,712 2,912 Updated Apr 30, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,581 2,749 Updated Aug 12, 2024

OpenBMB / MiniCPM-o

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 24,144 1,874 Updated Mar 7, 2026

resemble-ai / chatterbox

SoTA open-source TTS

Python 23,696 3,138 Updated Mar 18, 2026

deepseek-ai / DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Python 22,938 2,742 Updated Nov 11, 2025

Skyvern-AI / skyvern

Automate browser based workflows with AI

Python 20,848 1,852 Updated Mar 18, 2026

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 19,201 1,683 Updated Nov 19, 2025

cheahjs / free-llm-api-resources

A list of free LLM inference resources accessible via API.

Python 16,205 1,613 Updated Mar 10, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chanakya Hosamani godsofheaven

Achievements