hub

CodeZen

CodeZen

一个专注中文区的 GitHub 项目发现

avatar

mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

apple-silicon audio-processing mlx multimodal speech-recognition
star2.8k
Python
avatar

InkyPi

E-Ink Display with a Raspberry Pi and a Web Interface to customize and update the display with various plugins

eink epaper inkypi python raspberry-pi
star2.9k
Python
avatar

dify-for-dsl

本项目是基于dify开源项目实现的dsl工作流脚本合集

star2.9k
Python
avatar

Automated-AI-Web-Researcher-Ollama

A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from various relevant websites and do research for you all on its own! And more, not limited to but including saving the findings for you!

star2.9k
Python
avatar

Sonic

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

star3.1k
Python
avatar

nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

comfyui diffusion-models flux genai iclr
star3.3k
Python
avatar

morphik-core

The most accurate document search and store for building AI apps

artificial-intelligence cache-augmented-generation colpali database litellm
star3.4k
Python
avatar

smollm

Everything about the SmolLM and SmolVLM family of models

star3.4k
Python
avatar

boltz

Official repository for the Boltz biomolecular interaction models

star3.4k
Python
avatar

OML-1.0-Fingerprinting

OML 1.0 via Fingerprinting: Open, Monetizable, and Loyal AI

fine-tuning fingerprint loyalty oml sentient
star3.5k
Python
avatar

ClearerVoice-Studio

ClearerVoice-Studio is an open-source, AI-powered speech processing toolkit designed for researchers, developers, and end-users. It provides capabilities of speech enhancement, speech separation, speech super-resolution, target speaker extraction, and more. The toolkit provides state-of-the-art pre-

audio bandwidth-extension deep-learning noise-suppression pytorch
star3.6k
Python
avatar

MagicQuill

🪶 MagicQuill: An Intelligent Interactive Image Editing System *CVPR 2025* https://github.com/user-attachments/assets/8ee9663a-fef2-484a-a0b7-8427ab590424 There is an HD video on Youtubehttps://www.youtube.com/watch?v=5DiKfONMnE4. Zichen Liuhttps://zliucz.github.io\*,1,2, Yue Yuhttps://bruce

aigc gradio image-editing mllm
star3.6k
Python