Skip to content
View poussa's full-sized avatar

Organizations

@opea-project

Block or report poussa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

helm charts for deploying models with llm-d

Go Template 26 46 Updated Jan 27, 2026

ATP Tennis Rankings, Results, and Stats

1,430 679 Updated Dec 30, 2024

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,413 298 Updated Jan 28, 2026

llm-d benchmark scripts and tooling

Jupyter Notebook 42 43 Updated Jan 27, 2026

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 825 119 Updated Jan 28, 2026

llm-d helm charts and deployment examples

Shell 48 53 Updated Dec 13, 2025

Gateway API Inference Extension

Go 573 228 Updated Jan 28, 2026

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,574 295 Updated Jan 28, 2026

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

Go 1,135 125 Updated Jan 27, 2026

Intel® AI for Enterprise RAG converts enterprise data into actionable insights with excellent TCO. Utilizing Intel Gaudi AI accelerators and Intel Xeon processors ensuring streamlined deployment.

Python 47 22 Updated Jan 27, 2026

Workload Services Framework (WSF) is a benchmarking framework on Intel(R) Xeon(R) Platforms

Shell 59 54 Updated Jan 26, 2026

Terraform provider for Keycloak

Go 880 389 Updated Jan 26, 2026

GenAI Studio is a low code platform to enable users to construct, evaluate, and benchmark GenAI applications. The platform also provide capability to export developed application as a ready-to-depl…

JavaScript 58 26 Updated Jan 15, 2026

Containerization and cloud native suite for OPEA

Go 74 98 Updated Jan 5, 2026

A repository that deploys Coder OSS entirely from TF

HCL 175 52 Updated Feb 1, 2023

AWS EKS - kubernetes project and terraform module

HCL 329 169 Updated Oct 17, 2025

Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.

Go 1,656 420 Updated Jan 27, 2026

Collection of Intel device plugins for Kubernetes

Go 118 216 Updated Jan 28, 2026

EKS Node Viewer

Go 1,580 143 Updated Jan 26, 2026

A collection of community maintained NRI plugins

Go 100 31 Updated Jan 28, 2026

An End-to-End Distributed and Scalable Cloud KMS (Key Management System) built on top of Intel SGX enclave-based HSM (Hardware Security Module), aka eHSM.

C++ 168 54 Updated Jul 25, 2024
Go Template 23 31 Updated Jan 28, 2026

Node Resource Interface

Go 356 86 Updated Jan 28, 2026

Production-Grade Container Scheduling and Management

Go 120,124 42,310 Updated Jan 28, 2026

This repo follows the SDS extension standard of Envoy and implements an external SDS server via more secure solution which is known as Hardware Security Module(HSM). By using this repo, User can m…

Go 6 6 Updated Apr 2, 2024

Intel QuickAssist Technology( QAT) OpenSSL Engine (an OpenSSL Plug-In Engine) which provides cryptographic acceleration for both hardware and optimized software using Intel QuickAssist Technology e…

C 437 136 Updated Aug 20, 2025
Mustache 17 16 Updated Dec 11, 2025
Go 11 14 Updated Nov 20, 2024
Next