Skip to content
View oklen's full-sized avatar
🎲
🎲

Block or report oklen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

slime is an LLM post-training framework for RL Scaling.

Python 3,699 500 Updated Feb 5, 2026

Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Python 552 50 Updated Feb 2, 2026

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 572 63 Updated Feb 6, 2026

ResearcherBench: Evaluating Deep AI Research Systems on the Frontiers of Scientific Inquiry

Python 45 5 Updated Jan 5, 2026

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 19,663 2,469 Updated Feb 6, 2026

New GUI for our SECM Device!

C 2 Updated Jan 7, 2019

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 419 15 Updated Jul 11, 2025

[NeurIPS 2025] Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning

Python 123 5 Updated Dec 13, 2025
Python 159 9 Updated Apr 17, 2025

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 697 46 Updated Oct 15, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,180 1,403 Updated Jan 21, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,038 3,203 Updated Feb 6, 2026

Pure-Python Server Side Events (SSE) client

Python 227 35 Updated Jan 2, 2026

Best practices for distilling large language models.

Jupyter Notebook 605 53 Updated Feb 1, 2024

Dromedary: towards helpful, ethical and reliable LLMs.

Python 1,143 89 Updated Sep 18, 2025

All things prompt engineering

Python 5,730 329 Updated Jun 4, 2024

总结Prompt&LLM论文,开源数据&模型,AIGC应用

3,346 321 Updated Jan 19, 2026

Offical Code for "PEVAE: A Hierarchical VAE for Personalized Explainable Recommendation."

Python 12 1 Updated Oct 12, 2022

A library for building and serving multi-node distributed faiss indices.

Python 276 21 Updated Nov 1, 2023

State-of-the-Art Text Embeddings

Python 18,213 2,755 Updated Feb 5, 2026

☕ A tool to generate requirements.txt for Python project, and more than that. (IT IS NOT A PACKAGE MANAGEMENT TOOL)

Python 1,780 90 Updated Jan 9, 2026

Library for Knowledge Intensive Language Tasks

Python 963 90 Updated Mar 31, 2022

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,859 316 Updated Apr 6, 2023

POT : Python Optimal Transport

Python 2,743 538 Updated Feb 5, 2026

Toolbox to integrate optimal transport loss functions using automatic differentiation and Sinkhorn's algorithm

Python 449 39 Updated May 14, 2018

New Game Start!

C++ 2 Updated Jan 4, 2020

SMedBERT: A Knowledge-Enhanced Pre-trained Language Model withStructured Semantics for Medical Text Mining

Python 83 15 Updated Nov 17, 2021
Python 3 1 Updated Nov 4, 2022

PyTorch package for the discrete VAE used for DALL·E.

Python 10,875 1,893 Updated Jan 31, 2024

Jupyter notebook on Gumbel-max and Gumbel-softmax tricks

Jupyter Notebook 40 8 Updated Nov 11, 2022
Next