Orange: Interactive data analysis
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Project structure for doing and sharing data science work
An AI-powered data science team of agents
Fast, flexible and powerful Python data analysis toolkit
Data integration platform for ELT pipelines from APIs, databases
Machine learning in Python
CKAN is an open-source DMS for powering data hubs
An orchestration platform for the development, production
Efficiently diff rows across two different databases
Light-weight, flexible, expressive statistical data testing library
Uncover insights, surface problems, monitor, and fine tune your LLM
Training data (data labeling, annotation, workflow) for all data types
WebGL-based viewer for volumetric data
Python data, Leaflet.js maps
Benchmarking synthetic data generation methods
Positron, a next-generation data science IDE
AI-data warehouse to enrich, transform and analyze unstructured data
matplotlib: plotting with Python
Synthetic data generators for structured and unstructured text
Spatial data processing for geomodeling
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Always know what to expect from your data
Visualize and compare datasets, target values and associations
Detecting silent model failure. NannyML estimates performance