-
DatologyAI
- San Francisco
- https://www.anshumansuri.com/
- @iamgroot42
Highlights
-
-
datatrove Public
Forked from huggingface/datatroveFreeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
-
TAP Public
Forked from RICommunity/TAPTAP: An automated jailbreaking method for black-box LLMs
-
gpu-dashboard Public
Forked from johnmath/gpu-dashboardMonitoring lab machines
-
nanoGCG Public
Forked from GraySwanAI/nanoGCGA fast + lightweight implementation of the GCG algorithm in PyTorch
-
-
-
smolagents Public
Forked from huggingface/smolagents🤗 smolagents: a barebones library for agents that think in code.
-
cy4100_tools_and_agents Public
Code for 'Tools and Agents' lecture for CY4100
-
assignment1-basics Public
Forked from stanford-cs336/assignment1-basicsStudent version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
-
KittenTTS Public
Forked from KittenML/KittenTTSState-of-the-art TTS model under 25MB 😻
-
easy-dataset-share Public
Forked from Responsible-Dataset-Sharing/easy-dataset-shareA CLI tool that helps AI researchers share datasets responsibly.
-
assignment4-data Public
Forked from stanford-cs336/assignment4-dataPython MIT License UpdatedJul 21, 2025 -
mimir Public
Python package for measuring memorization in LLMs.
-
stamp Public
Forked from codeboy5/stampCode for the Paper: "STAMP Your Content: Proving Dataset Membership via Watermarked Rephrasings"
-
FlexOlmo Public
Forked from allenai/FlexOlmoCode and training scripts for FlexOlmo
-
An awesome list of papers on distribution/property inference in machine learning
-
FlagEmbedding Public
Forked from FlagOpen/FlagEmbeddingRetrieval and Retrieval-augmented LLMs
-
mteb Public
Forked from embeddings-benchmark/mtebMTEB: Massive Text Embedding Benchmark
-
results Public
Forked from embeddings-benchmark/resultsData for the MTEB leaderboard
-
llm-adaptive-attacks Public
Forked from tml-epfl/llm-adaptive-attacksJailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]
-
MIDSTModels Public
Forked from VectorInstitute/MIDSTModelsReference implementations for the MIDST challenge (Membership Inference over Diffusion-models-based Synthetic Tabular data) - SaTML 2025!
-
-
jailbreak-objectives Public
Forked from facebookresearch/jailbreak-objectivesCode and data to go with the Zhu et al. paper "An Objective for Nuanced LLM Jailbreaks"
-
iha_hild Public
Code for our paper 'Do Parameters Reveal More than Loss for Membership Inference?'
-
-
-
contrastors Public
Forked from nomic-ai/contrastorsTrain Models Contrastively in Pytorch
Python Apache License 2.0 UpdatedOct 18, 2024 -
MAZE Public
Forked from sanjaykariyappa/MAZEImplementation of the paper "MAZE: Data-Free Model Stealing Attack Using Zeroth-Order Gradient Estimation".
-





