-
Storefront
- Osaka
-
12:18
(UTC +09:00)
Stars
超能文献|AI驱动的文档翻译与学术搜索服务。支持PDF、DOCX、PPTX等多格式文档的高质量翻译(支持11种语言),特别优化了数学公式翻译。同时提供PubMed学术文献智能搜索功能。更多访问:https://suppr.wilddata.cn
Quick review, jump, and favorite any message in your AI Chat 快速预览、跳转、收藏你与AI的对话
Official code for TimeCraft: A Time Series Generation Framework for Real-World Applications
Nexent is a zero-code platform for auto-generating agents — no orchestration, no complex drag-and-drop required. Nexent also offers powerful capabilities for agent running control, data processing …
Enterprise-grade, commercial-friendly agentic workflow platform for building next-generation SuperAgents.
✨ WithAnyone is capable of generating high-quality, controllable, and ID consistent images
Discrete Diffusion Divergence Instruct (DiDi-Instruct)
Official Implementation of MM25 paper:DeflareMamba: Hierarchical Vision Mamba for Contextually Consistent Lens Flare Removal
CAVER: Cross-Modal View-Mixed Transformer for Bi-Modal Salient Object Detection
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better
Arthas MCP Server is an MCP-based diagnostic toolkit for Java applications, designed for LLM integration. It integrates with Alibaba Arthas so AI assistants can analyze and diagnose Java apps.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures", "Learning to count object", "Bottom-up top-down" for Visu…
Neural image captioning (NIC) implementation with Keras 2.
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation
(ECCVW 2025)GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
Multilingual Document Layout Parsing in a Single Vision-Language Model