Lists (3)
Sort Name ascending (A-Z)
Stars
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
21 Lessons, Get Started Building with Generative AI
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
Lucene++ is an up to date C++ port of the popular Java Lucene library, a high-performance, full-featured text search engine.
Recognize font from image using DeepFont technique.
【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
这是一个为大模型提供 A 股数据的的 MCP(Model Content Protocol) 服务。
2025年 qt 开发最新总结,提供全面的 qt 开发学习资源,涵盖从基础知识到实战项目的资料、文献、书籍、项目和示例,帮助你快速入门并逐步进阶,持续更新维护中!
The official Open-Asset-Importer-Library Repository. Loads 40+ 3D-file-formats into one unified and clean data structure.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
portable Time Stamp Server (over HTTP)
PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具
QtNetwork Next Generation. A coroutine based network framework for Qt/C++, with more simpler API than boost::asio.
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning
Get your documents ready for gen AI
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
Convert PDF to markdown + JSON quickly with high accuracy
整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Interactive roadmaps, guides and other educational content to help developers grow in their careers.
