hub

CodeZen

CodeZen

一个专注中文区的 GitHub 项目发现

avatar

datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

ai artificial-intelligence computer-vision dataset-hub datasets
star20.8k
Python
avatar

Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

efficient-deep-learning llm model-compression pruning transformers
star3.2k
Python
avatar

md

✍ WeChat Markdown Editor | 一款高度简洁的微信 Markdown 编辑器:支持 Markdown 语法、自定义主题样式、内容管理、多图床、AI 助手等特性

ai-bot doocs editor llm markdown
star10.9k
Vue
avatar

haystack

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

agent agents ai gemini generative-ai
star23.3k
MDX
avatar

flyte

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

data data-analysis data-science dataops declarative
star6.6k
Go
avatar

metaflow

Build, Manage and Deploy AI/ML Systems

agents ai aws azure cost-optimization
star9.6k
Python
avatar

milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

anns cloud-native diskann distributed embedding-database
star38.3k
Go
avatar

deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

ai computer-vision cv data-science datalake
star8.9k
Python
avatar

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

beit beit-3 bitnet deepnet document-ai
star21.8k
Python
avatar

awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

bert chinese dataset ernie gpt
star5.4k
Python
avatar

trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

article-extractor corpus-builder corpus-tools crawler html-to-markdown
star4.9k
Python
avatar

BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

ai-inference deep-learning generative-ai inference-platform llm
star8.2k
Python