-
Krai
- Cambridge, UK
-
23:23
(UTC) - http://uk.linkedin.com/in/lokhmotov
Stars
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions
Introduction to Machine Learning Systems
Fully open reproduction of DeepSeek-R1
ROCm / vllm
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
The best OSS video generation models, created by Genmo
A high-throughput and memory-efficient inference and serving engine for LLMs
This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transformers library) into inference-ready formats that run efficien…
A high-performance, "quantum-inspired" Fast Fourier Transform (FFT) library written in pure and safe Rust.
Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high throughput and low latency across Computer Vision, Object De…
Simple HTML5 Charts using the <canvas> tag
Qualcomm Cloud AI (QAIC) implementation of MLPerf Inference benchmarks
SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) architectures. SparseP is developed to evaluate and characteri…
krai / ck-mlperf
Forked from dividiti/ck-mlperfAutomated workflows for MLPerf, the industry-leading benchmark for evaluating performance of ML software and hardware
A collection of pre-trained, state-of-the-art models in the ONNX format
Example code and applications for machine learning on Graphcore IPUs
This repository contains the results and code for the MLPerf™ Inference v1.0 benchmark.
dividiti / ck-tensorflow
Forked from ctuning/ck-tensorflowCollective Knowledge components for TensorFlow (code, data sets, models, packages, workflows):
dividiti / ck-mlperf
Forked from ctuning/ck-mlperfCollective Knowledge repository to automate MLPerf - a broad ML benchmark suite for measuring performance of ML software frameworks, ML hardware accelerators, and ML cloud platforms:
Dev repo for power measurement for the MLPerf™ benchmarks
Code for Neural Architecture Search without Training (ICML 2021)





