Stars
- All languages
- ApacheConf
- Arc
- C
- C#
- C++
- CSS
- CartoCSS
- Clojure
- CoffeeScript
- Common Lisp
- Cuda
- Cython
- DIGITAL Command Language
- Emacs Lisp
- Erlang
- Go
- Groovy
- HTML
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- MDX
- Makefile
- Objective-C
- OpenEdge ABL
- PHP
- PLpgSQL
- Perl
- Python
- QML
- R
- Racket
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Scheme
- Shell
- Svelte
- TSQL
- Tcl
- TeX
- Twig
- TypeScript
- VHDL
- Vim Script
- XSLT
Visually explore, understand, and present your data.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK stack
Pretrained language model with 100B parameters
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
A library for efficient similarity search and clustering of dense vectors.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Stable Diffusion with Core ML on Apple Silicon
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …
A latent text-to-image diffusion model
Bumble's Private Detector - a pretrained model for detecting lewd images
📊 Open source visualization dashboards for OpenSearch.
Carrot2: Text Clustering Algorithms and Applications
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
A python library for decision tree visualization and model interpretation.
Sequential model-based optimization with a `scipy.optimize` interface
Web-scale retrieval for knowledge-intensive NLP
Resources for tackling record linkage / deduplication / data matching problems
Code to compute permutation and drop-column importances in Python scikit-learn models
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
STUMPY is a powerful and scalable Python library for modern time series analysis
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.