Stars
📚 Freely available programming books
Robust Speech Recognition via Large-Scale Weak Supervision
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
The world's simplest facial recognition api for Python and the command line
A generative speech model for daily dialogue.
State-of-the-art 2D and 3D Face Analysis Project
Industry leading face manipulation platform
Xiaomi Home Integration for Home Assistant
🌈Python3网络爬虫实战:淘宝、京东、网易云、B站、12306、抖音、笔趣阁、漫画小说下载、音乐电影下载等
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Android in docker solution with noVNC supported and video recording
Python version of the Playwright testing and automation library.
🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理!- A powered tool for easy and efficient video subtitling.
Freeze (package) Python programs into stand-alone executables
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.
Azur Lane bot (CN/EN/JP/TW) 碧蓝航线脚本 | 无缝委托科研,全自动大世界
INFO-SPIDER 是一个集众多数据源于一身的爬虫工具箱🧰,旨在安全快捷的帮助用户拿回自己的数据,工具代码开源,流程透明。支持数据源包括GitHub、QQ邮箱、网易邮箱、阿里邮箱、新浪邮箱、Hotmail邮箱、Outlook邮箱、京东、淘宝、支付宝、中国移动、中国联通、中国电信、知乎、哔哩哔哩、网易云音乐、QQ好友、QQ群、生成朋友圈相册、浏览器浏览历史、12306、博客园、CSDN博客…
PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models
Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖
The most powerful Android RPA agent framework, next generation of mobile automation robots.
This repo contains source code and materials for the TEmporally COherent GAN SIGGRAPH project.
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, MNN, PaddlePaddle and PyTorch.
Gemini polling proxy service (gemini轮询代理服务)
QD [v20240210] —— HTTP请求定时任务自动执行框架 base on HAR Editor and Tornado Server