Machine Learning and AI Engineer open source tools and frameworks

deeplake

⭐ 8896 Python Score 107 Updated 2025-11-05

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time...

ai computer-vision cv data-science datalake datasets

transformers

⭐ 152554 Python Score 97 Updated 2025-11-15

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference...

audio deep-learning deepseek gemma glm hacktoberfest

annotated_deep_learning_paper_implementations

⭐ 64307 Python Score 97 Updated 2025-11-11

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optim...

attention deep-learning deep-learning-tutorial gan literate-programming lora

keras

⭐ 63559 Python Score 97 Updated 2025-11-14

Deep Learning for humans

data-science deep-learning jax machine-learning neural-networks python

yolov5

⭐ 56029 Python Score 97 Updated 2025-11-09

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

coreml deep-learning ios machine-learning ml object-detection

ray

⭐ 39837 Python Score 97 Updated 2025-11-16

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

data-science deep-learning deployment distributed hyperparameter-optimization hyperparameter-search

DocsGPT

⭐ 17367 Python Score 97 Updated 2025-11-14

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API conn...

agent-builder agents ai chatgpt docsgpt hacktoberfest

txtai

⭐ 11819 Python Score 97 Updated 2025-11-14

💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

ai artificial-intelligence embeddings information-retrieval language-model large-language-models

ml-engineering

⭐ 15736 Python Score 89 Updated 2025-10-27

Machine Learning Engineering Open Book

ai debugging gpus inference large-language-models llm

wandb

⭐ 10540 Python Score 89 Updated 2025-11-15

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

ai collaboration data-science data-versioning deep-learning experiment-track

LLMs-from-scratch

⭐ 78754 Jupyter Notebook Score 85 Updated 2025-11-13

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

ai artificial-intelligence chatbot chatgpt deep-learning from-scratch

Real-Time-Voice-Cloning

⭐ 58847 Python Score 82 Updated 2025-09-23

Clone a voice in 5 seconds to generate arbitrary speech in real-time

deep-learning python pytorch tensorflow tts voice-cloning

memvid

⭐ 10381 Python Score 82 Updated 2025-10-12

Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

ai context embedded faiss knowledge-base knowledge-graph

pytorch

⭐ 95090 Python Score 81 Updated 2025-11-16

Tensors and Dynamic neural networks in Python with strong GPU acceleration

autograd deep-learning gpu machine-learning neural-network numpy

faceswap

⭐ 54717 Python Score 81 Updated 2025-11-11

Deepfakes Software For All

deep-face-swap deep-learning deep-neural-networks deepface deepfakes deeplearning

ultralytics

⭐ 48730 Python Score 81 Updated 2025-11-16

Ultralytics YOLO 🚀

cli computer-vision deep-learning hub image-classification instance-segmentation

DeepSpeed

⭐ 40703 Python Score 81 Updated 2025-11-14

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

billion-parameters compression data-parallelism deep-learning gpu inference

stable-diffusion-webui

⭐ 158229 Python Score 81 Updated 2025-11-07

Stable Diffusion web UI

ai ai-art deep-learning diffusion gradio image-generation

langchain

⭐ 119742 Python Score 81 Updated 2025-11-16

🦜🔗 The platform for reliable agents.

agents ai ai-agents ai-agents-framework aiagentframework anthropic

vllm

⭐ 63153 Python Score 81 Updated 2025-11-16

A high-throughput and memory-efficient inference and serving engine for LLMs

amd blackwell cuda deepseek deepseek-v3 gpt

LLaMA-Factory

⭐ 62541 Python Score 81 Updated 2025-11-13

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

agent ai deepseek fine-tuning gemma gpt

llama_index

⭐ 45244 Python Score 81 Updated 2025-11-14

LlamaIndex is the leading framework for building LLM-powered agents over your data.

agents application data fine-tuning framework llamaindex

peft

⭐ 20057 Python Score 81 Updated 2025-11-13

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

adapter diffusion fine-tuning llm lora parameter-efficient-learning

RWKV-LM

⭐ 14142 Python Score 81 Updated 2025-11-14

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "...

attention-mechanism chatgpt deep-learning gpt gpt-2 gpt-3

speechbrain

⭐ 10794 Python Score 81 Updated 2025-11-07

A PyTorch-based Speech Toolkit

asr audio audio-processing deep-learning huggingface language-model

tensorflow

⭐ 192439 C++ Score 77 Updated 2025-11-16

An Open Source Machine Learning Framework for Everyone

deep-learning deep-neural-networks distributed machine-learning ml neural-network

haystack

⭐ 23381 MDX Score 77 Updated 2025-11-14

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or...

agent agents ai gemini generative-ai gpt-4

metaflow

⭐ 9627 Python Score 75 Updated 2025-11-15

Build, Manage and Deploy AI/ML Systems

agents ai aws azure cost-optimization datascience

BentoML

⭐ 8204 Python Score 75 Updated 2025-11-15

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

ai-inference deep-learning generative-ai inference-platform llm llm-inference

airweave

⭐ 5186 Python Score 75 Updated 2025-11-16

Context retrieval for AI agents across apps and databases

agents knowledge-graph llm llm-agent rag search

LEANN

⭐ 4379 Python Score 75 Updated 2025-11-14

RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

ai faiss gpt-oss langchain llama-index llm

scikit-learn

⭐ 64038 Python Score 73 Updated 2025-11-15

scikit-learn: machine learning in Python

data-analysis data-science machine-learning python statistics

OpenBB

⭐ 54578 Python Score 73 Updated 2025-11-12

Financial data platform for analysts, quants and AI agents.

ai crypto derivatives economics equity finance

streamlit

⭐ 42233 Python Score 73 Updated 2025-11-16

Streamlit — A faster way to build and share data apps.

data-analysis data-science data-visualization deep-learning developer-tools machine-learning

gradio

⭐ 40527 Python Score 73 Updated 2025-11-14

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

data-analysis data-science data-visualization deep-learning deploy gradio

MockingBird

⭐ 36765 Python Score 73 Updated 2025-11-13

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

ai deep-learning pytorch speech text-to-speech tts

mlflow

⭐ 22961 Python Score 73 Updated 2025-11-15

The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and e...

agentops agents ai ai-governance apache-spark evaluation

browser-use

⭐ 72584 Python Score 73 Updated 2025-11-16

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

ai-agents ai-tools browser-automation browser-use llm playwright

OpenHands

⭐ 65002 Python Score 73 Updated 2025-11-16

🙌 OpenHands: Code Less, Make More

agent artificial-intelligence chatgpt claude-ai cli developer-tools

mem0

⭐ 43172 Python Score 73 Updated 2025-11-15

Universal memory layer for AI Agents

agents ai ai-agents application chatbots chatgpt

chatgpt-on-wechat

⭐ 39717 Python Score 73 Updated 2025-10-22

基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择ChatGPT/Claude/DeepSeek/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。

ai ai-agent chatgpt claude-4 deepseek dingtalk

Langchain-Chatchat

⭐ 36563 Python Score 73 Updated 2025-11-10

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local kno...

chatbot chatchat chatglm chatgpt embedding faiss

PaddleNLP

⭐ 12844 Python Score 73 Updated 2025-11-14

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

bert compression distributed-training document-intelligence embedding ernie

segmentation_models.pytorch

⭐ 11068 Python Score 73 Updated 2025-10-29

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

computer-vision deeplab-v3-plus deeplabv3 dpt fpn image-processing

ComfyUI

⭐ 93678 Python Score 73 Updated 2025-11-16

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

ai comfy comfyui python pytorch stable-diffusion

nni

⭐ 14292 Python Score 72 Updated 2024-07-03

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper...

automated-machine-learning automl bayesian-optimization data-science deep-learning deep-neural-network

llm-app

⭐ 46830 Jupyter Notebook Score 69 Updated 2025-10-23

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3,...

chatbot hugging-face llm llm-local llm-prompting llm-security

ragflow

⭐ 67757 TypeScript Score 69 Updated 2025-11-14

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context...

agent agentic agentic-ai agentic-workflow ai ai-search

tensorzero

⭐ 10554 Rust Score 69 Updated 2025-11-16

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

ai ai-engineering anthropic artificial-intelligence deep-learning genai

anything-llm

⭐ 51094 JavaScript Score 69 Updated 2025-11-07

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

ai-agents custom-ai-agents deepseek kimi llama3 llm

Machine Learning and AI Engineer open source stack