-
Adobe
- Bangalore,India
-
00:31
(UTC -12:00)
Highlights
Stars
Mercury is a transforming drone anyone can build that can be adapted for many use cases thanks to it's versatile mobility, wide range of sensors, and cargo bay area
A group of notebooks and other files which can help you learn AI from scratch.
π RuView: WiFi DensePose turns commodity WiFi signals into real-time human pose estimation, vital sign monitoring, and presence detection — all without a single pixel of video.
This repository lists some awesome public projects about Zero-shot/Few-shot Learning based on CLIP (Contrastive Language-Image Pre-Training).
LLM Council works together to answer your hardest questions
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
Blind&Invisible Watermark ,图片盲水印,提取水印无须原图!
Demystify AI agents by building them yourself. Local LLMs, no black boxes, real understanding of function calling, memory, and ReAct patterns.
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to con…
[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
zero-shot voice conversion & singing voice conversion, with real-time support
AI wearables. Put it on, speak, transcribe, automatically
Efficient Part-level 3D Object Generation via Dual Volume Packing
Supercharge Your LLM with the Fastest KV Cache Layer
Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
Local, open-source AI app builder for power users ✨ v0 / Lovable / Replit / Bolt alternative 🌟 Star if you like it!
The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…
A self hosted virtual browser that runs in docker and uses WebRTC.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
