CodeZen
一个专注中文区的 GitHub 项目发现
所有语言
C
C#
C++
CSS
Dart
Go
HTML
Java
JavaScript
Jupyter Notebook
Kotlin
Lua
PHP
PowerShell
Python
Ruby
Rust
Shell
Swift
TypeScript
Vue
datasets
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
Torch-Pruning
[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.
md
✍ WeChat Markdown Editor | 一款高度简洁的微信 Markdown 编辑器:支持 Markdown 语法、自定义主题样式、内容管理、多图床、AI 助手等特性
haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
metaflow
Build, Manage and Deploy AI/ML Systems
milvus
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!