hub

CodeZen

CodeZen

一个专注中文区的 GitHub 项目发现

avatar

seed-vc

Seed-VC *English | 简体中文README-ZH.md | 日本語README-JA.md* real-time-demo.webmhttps://github.com/user-attachments/assets/86325c5e-f7f6-4a04-8695-97275a5d046c Currently released model supports *zero-shot voice conversion* 🔊 , *zero-shot real-time voice conversion* 🗣️ and *zero-shot singing voice

singing-voice-conversion voice-conversion
star3.4k
Python
avatar

PlugNPlay-Modules

全网最全最新的即插即用模块:目前进度70% 包括卷积 注意力机制 下采样 特征融合模块等 持续更新~ 详细论文讲解关注公众号【ai缝合大王】和B站【ai缝合大王】 模块分享、缝合交流进q群: 994264161 更多细分方向群:① 目标检测 ② 图像分类 ③ 语义分割 ④ 人脸识别 ⑤ 三维重建 ⑥ 多模态融合 ⑦ 姿态估计 ⑧ 超分辨率⑨ 自动驾驶 ⑩ 图像生成 ⑪ 遥感影像 ⑫ 医学图像 ⑬ 底层视觉 ⑭ YOLO 系列 ⑮ Mamba 等新架构⑯ 视频处理 ⑰ 3D ⑱ 大模型 ⑲ 重识别(ReID)⑳ 图像去雨/去噪/去模糊 细分方向群为微信群,扫描二维码添加微信,扣1-

star4.8k
Python
avatar

minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

artificial-intelligence chatgpt vision-language-model
star5.2k
Python
avatar

KAG

KAG: Knowledge Augmented Generation English | 简体中文 | 日本語版ドキュメント 1. What is KAG? KAG is a logical reasoning and Q&A framework based on the OpenSPGhttps://github.com/OpenSPG/openspg engine and lar

knowledge-graph large-language-model logical-reasoning multi-hop-question-answering trustfulness
star8.2k
Python
avatar

ha_xiaomi_home

Xiaomi Home Integration for Home Assistant English./README.md | 简体中文./doc/README_zh.md Xiaomi Home Integration is an integrated component of Home Assistant supported by Xiaomi official. It allows you to use Xiaomi IoT smart devices in Home Assistant. Installation > Home Assistant version require

home-assistant home-assistant-integration miot miot-devices smart-home
star20.9k
Python
avatar

PDFMathTranslate

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

chinese document edit english japanese
star29.7k
Python
avatar

Show-o

One Single Transformer to Unify Multimodal Understanding and Generation Jinheng Xiehttps://sierkinhane.github.io/1&42;  Weijia Maohttps://scholar.google.com/citations?hl=zh-CN&user=S7bGBmkyNtEC&view_op=list_works&sortby=pubdate1&42;  Zechen Baihttps://www.baizechen.site/1&42;  David

diffusion-models large-language-models multimodal
star1.8k
Python
avatar

LongWriter

LongWriter:释放长上下文LLM的10,000+字生成能力 🤗 HF 仓库 • 📃 论文 • 🚀 HF 空间 English./README.md | 中文./README_zh.md | 日本語./README_jp.md https://github.com/user-attachments/assets/c7eedeca-98ed-43ec-8619-25137987bcde 左:LongWriter-glm4-9b;右:GLM-4-9B-chat 🔥 更新 **2024年8月18日** 您现在可以使用vllmhttps://github.com/vllm-

fine-tuning llm long-context long-text
star1.8k
Python
avatar

ProxyCat

一款部署于云端或本地的隧道代理池中间件,可将静态代理IP灵活运用成隧道IP,提供固定请求地址,一次部署终身使用

cyber-security cyber-security-tool proxy proxypool security
star2.3k
Python
avatar

VITA

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction 📖 VITA-1.5 Paperhttps://arxiv.org/pdf/2501.01957 🤖 Basic Demohttps://modelscope.cn/studios/modelscope/VITA1.5_demo 🍎 VITA-1.0https://vita-home.github.io/ 💬 WeChat 微信./asset/wechat-group.jpg --- 📽 VITA-1.5 De

large-multimodal-models multimodal-large-language-models omni-language-model omni-modal-video-understanding omni-model
star2.4k
Python
avatar

llm_related

复现大模型相关算法及一些学习记录

star2.5k
Python
avatar

D-FINE

English | 简体中文README_cn.md | 日本語README_ja.md | English Blogsrc/zoo/dfine/blog.md | 中文博客src/zoo/dfine/blog_cn.md D-FINE: Redefine Regression Task of DETRs as Fine&8209;grained Distribution Refinement

d-fine detr object-detection
star2.8k
Python