AutoGluon: AutoML for Image, Text, and Tabular Data
Detecting silent model failure. NannyML estimates performance
Data science on data without acquiring a copy
The open-source tool for building high-quality datasets
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Investment Research for Everyone, Everywhere
Python Stream Processing
Evaluate and monitor ML models from validation to production
Effortless data labeling with AI support from Segment Anything
Python library for defining and optimizing mathematical expressions
Helps data scientists define testable self-documenting dataflows
Train machine learning models within Docker containers
The machine learning toolkit for time series analysis in Python
Test Suites for validating ML models & data
Supercharge Your Model Training
Fault-tolerant, highly scalable GPU orchestration
High-level training, data augmentation, and utilities for Pytorch
A curated list of data mining papers about fraud detection
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Making Enterprise Data Intelligent and Responsive for AI
Create HTML profiling reports from pandas DataFrame objects
Foundation Model for Tabular Data
Open source framework for deep learning satellite and aerial imagery
A library of extension and helper modules for Python's data analysis