Search Results for "sandbox:/mnt/data/project_plan.pod"

Sort By:

Showing 3488 open source projects for "sandbox:/mnt/data/project_plan.pod"

View related business solutions

Python Clear Filters & Widen Search

The full-stack observability platform that protects your dataLayer, tags and conversion data
Stop losing revenue to bad data today. and protect your marketing data with Code-Cube.io.

Code-Cube.io detects issues instantly, alerts you in real time and helps you resolve them fast. No manual QA. No unreliable data. Just data you can trust and act on.

Learn More
The AI-powered unified PSA-RMM platform for modern MSPs.
Trusted PSA-RMM partner of MSPs worldwide

SuperOps.ai is the only PSA-RMM platform powered by intelligent automation and thoughtfully crafted for the new-age MSP. The platform also helps MSPs manage their projects, clients, and IT documents from a single place.

Learn More
1

E2B Desktop Sandbox

E2B Desktop Sandbox for LLMs. E2B Sandbox

E2B Desktop is an open-source sandboxed virtual desktop environment designed to enable secure “computer use” by large language models and automated agents. The platform provides isolated virtual machines where applications can be executed safely without affecting the host system. Each sandbox runs independently and can be configured with custom dependencies or tools required by an AI agent or automation workflow. The system allows developers to programmatically create and control these virtual desktops through SDKs available in languages such as Python and JavaScript. Within a sandbox, developers can launch applications like browsers, editors, or other software that an AI agent may need to interact with. ...

Downloads: 1 This Week

Last Update: 2026-03-09
See Project
2

Polymarket Data

Polymarket Data Retriever that fetches, processes, and structures data

Polymarket Data is a comprehensive data engineering pipeline designed to collect, process, and structure trading activity from the Polymarket prediction market ecosystem into analyzable datasets. The system operates as a multi-stage pipeline that integrates data from both off-chain APIs and on-chain event sources, enabling users to reconstruct full trading activity including markets, order events, and executed trades.

Downloads: 9 This Week

Last Update: 4 days ago
See Project
3

Orange Data Mining

Orange: Interactive data analysis

Open source machine learning and data visualization. Build data analysis workflows visually, with a large, diverse toolbox. Perform simple data analysis with clever data visualization. Explore statistical distributions, box plots and scatter plots, or dive deeper with decision trees, hierarchical clustering, heatmaps, MDS and linear projections. Even your multidimensional data can become sensible in 2D, especially with clever attribute ranking and selections. ...

Downloads: 70 This Week

Last Update: 2025-12-20
See Project
4

Data-Juicer

Data processing for and with foundation models

Data-Juicer is an open-source data processing and augmentation framework designed to enhance the quality and diversity of datasets for machine learning tasks. It includes a modular pipeline for scalable data transformation.

Downloads: 1 This Week

Last Update: 2026-03-17
See Project
Comet Backup - Fast, Secure Backup Software for MSPs
Fast, Secure Backup Software for Businesses and IT Providers

Comet is a flexible backup platform, giving you total control over your backup environment and storage destinations.

Learn More
5

OpenSandbox

OpenSandbox is a general-purpose sandbox platform for AI applications

OpenSandbox is a general purpose sandbox platform designed to securely run and isolate AI applications and untrusted workloads in controlled environments. The project focuses on providing a unified sandbox API that simplifies the process of executing code safely across different runtime backends. It supports multiple programming languages through SDKs, allowing developers to integrate sandbox capabilities into their systems without building custom isolation layers. ...

Downloads: 3 This Week

Last Update: 22 hours ago
See Project
6

AWS Data Wrangler

Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.

An AWS Professional Service open-source python initiative that extends the power of Pandas library to AWS connecting DataFrames and AWS data-related services. Easy integration with Athena, Glue, Redshift, Timestream, OpenSearch, Neptune, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON, and EXCEL). Built on top of other open-source projects like Pandas, Apache Arrow and Boto3, it offers abstracted functions to execute usual ETL tasks like load/unload data from Data Lakes, Data Warehouses, and Databases. ...

Downloads: 14 This Week

Last Update: 2026-04-08
See Project
7

Synthetic Data Generator

SDG is a specialized framework

...It also includes a data processing module capable of handling different data types, preprocessing columns, managing missing values, and converting formats automatically before model training.

Downloads: 11 This Week

Last Update: 2026-03-06
See Project
8

Pythonic Data Structures and Algorithms

Minimal examples of data structures and algorithms in Python

The Pythonic Data Structures and Algorithms repository by keon is a hands-on collection of implementations of classical data structures and algorithms written in Python. It offers working, often well-commented code for many standard algorithmic problems — from sorting/searching to graph algorithms, dynamic programming, data structures, and more — making it a valuable resource for learning and reference.

Downloads: 3 This Week

Last Update: 2026-02-18
See Project
9

Data Version Control

Git-based data version control for machine learning workflows

DVC (Data Version Control) is an open source tool designed to bring version control principles to machine learning and data science workflows. It enables developers and data scientists to track datasets, machine learning models, and experiment results in a way that integrates with existing Git repositories. Instead of storing large datasets directly in Git, DVC keeps lightweight metadata in the repository while storing the actual data in external storage systems. ...

Downloads: 7 This Week

Last Update: 2026-03-31
See Project
Get full visibility and control over your tasks and projects with Wrike.
A cloud-based collaboration, work management, and project management software

Wrike offers world-class features that empower cross-functional, distributed, or growing teams take their projects from the initial request stage all the way to tracking work progress and reporting results.

Learn More
10

Cookiecutter Data Science

Project structure for doing and sharing data science work

A logical, reasonably standardized, but flexible project structure for doing and sharing data science work. When we think about data analysis, we often think just about the resulting reports, insights, or visualizations. While these end products are generally the main event, it's easy to focus on making the products look nice and ignore the quality of the code that generates them. Because these end products are created programmatically, code quality is still important! ...

Downloads: 6 This Week

Last Update: 2025-07-24
See Project
11

HolmesGPT

CNCF Sandbox Project

...The system aggregates signals from observability tools such as logs, metrics, alerts, and distributed traces, then analyzes them using large language models to identify potential root causes. Rather than requiring engineers to manually correlate large volumes of monitoring data, HolmesGPT automatically synthesizes evidence and presents explanations in natural language. The project is developed by Robusta and has been accepted as a Cloud Native Computing Foundation Sandbox project, highlighting its relevance to the cloud-native ecosystem. It is designed to operate as an automated troubleshooting assistant that can analyze incidents continuously and support on-call engineers during outages.

Downloads: 18 This Week

Last Update: 2026-04-09
See Project
12

The Data Engineering Handbook

Links to everything you'd ever want to learn about data engineering

The Data Engineering Handbook is a comprehensive, community-curated repository that aggregates essential learning resources for anyone interested in becoming a professional data engineer. Rather than being a code project itself, it’s a learning handbook that links to books, articles, tutorials, community groups, boot camps, and real-world project examples that collectively form a roadmap to mastering data engineering skills.

Downloads: 3 This Week

Last Update: 2026-04-02
See Project
13

Agentic Data Scientist

An end-to-end Data Scientist

Agentic Data Scientist is an experimental AI-driven research framework that orchestrates data science workflows through autonomous agents that can reason, plan, and execute complex analytics tasks. Unlike traditional scripted pipelines, this project lets AI agents break down high-level research goals into sub-tasks such as data acquisition, cleaning, modeling, evaluation, and reporting, with minimal human direction.

Downloads: 1 This Week

Last Update: 2026-02-05
See Project
14

Data Science Articles from CodeCut

Collection of useful data science topics along with articles

The Data-science repository from CodeCutTech is a curated collection of educational content focused on practical tools and workflows used in modern data science projects. Instead of providing a single software package, the repository aggregates articles, tutorials, and examples covering many topics within the data science ecosystem. The materials address areas such as MLOps, data management, project organization, testing practices, visualization techniques, and productivity tools used by data scientists. ...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
15

Data Science Interviews

Data science interview questions and answers

Data Science Interviews is an open-source repository that collects common data science interview questions along with community-provided answers and explanations. The project serves as a preparation resource for students, job seekers, and professionals who want to review the technical knowledge required for data science roles. The repository organizes questions into different categories including theoretical machine learning concepts, technical programming questions, and probability or statistics problems. ...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
16

Book2_Beauty-of-Data-Visualization

Machine Learning, Criticism and Correction

Book2_Beauty-of-Data-Visualization is an open educational project that teaches the principles and techniques of effective data visualization using Python and modern plotting libraries. The repository focuses on both the technical and aesthetic aspects of visual analytics, helping learners understand how to communicate data clearly and persuasively. It includes practical examples that demonstrate how different chart types reveal patterns, trends, and distributions in real datasets. ...

Downloads: 0 This Week

Last Update: 2026-02-24
See Project
17

Dash Data Agent

Self-learning data agent that grounds its answers in layers of content

Dash is a self-learning data agent built by the Agno AI community that generates grounded answers to English queries over structured data by synthesizing SQL and reasoning based on six layers of context, improving automatically with each run. It sidesteps common limitations of simple text-to-SQL agents by incorporating multiple context layers — including schema structure, human annotations, known query patterns, institutional knowledge from docs, machine-discovered error patterns, and live runtime context — to generate SQL queries that are both technically correct and semantically meaningful. ...

Downloads: 0 This Week

Last Update: 2026-04-08
See Project
18

Synthetic Data Kit

Tool for generating high quality Synthetic datasets

Synthetic Data Kit is a CLI-centric toolkit for generating high-quality synthetic datasets to fine-tune Llama models, with an emphasis on producing reasoning traces and QA pairs that line up with modern instruction-tuning formats. It ships an opinionated, modular workflow that covers ingesting heterogeneous sources (documents, transcripts), prompting models to create labeled examples, and exporting to fine-tuning schemas with minimal glue code.

Downloads: 0 This Week

Last Update: 2025-10-25
See Project
19

Microsandbox

Secure local-first microVM sandbox for running untrusted code fast

Microsandbox is an open source platform designed to securely execute untrusted code in isolated environments using lightweight virtualization techniques. It focuses on combining strong security guarantees with fast startup times by leveraging hardware-level microVM isolation instead of relying solely on traditional containers or full virtual machines. It aims to solve the common tradeoffs between speed, isolation, and control that developers encounter when running untrusted workloads. It...

Downloads: 3 This Week

Last Update: 3 hours ago
See Project
20

cracking-the-data-science-interview

A Collection of Cheatsheets, Books, Questions, and Portfolio

Cracking the Data Science Interview is an open educational repository that collects study materials, resources, and reference links for preparing for data science interviews. The project organizes content across many fundamental areas of data science, including statistics, probability, SQL, machine learning, and deep learning. It includes cheat sheets that summarize important technical concepts commonly discussed during technical interviews.

Downloads: 3 This Week

Last Update: 2026-03-11
See Project
21

Synthetic Data Vault (SDV)

Synthetic Data Generation for tabular, relational and time series data

The Synthetic Data Vault (SDV) is a Synthetic Data Generation ecosystem of libraries that allows users to easily learn single-table, multi-table and timeseries datasets to later on generate new Synthetic Data that has the same format and statistical properties as the original dataset. Synthetic data can then be used to supplement, augment and in some cases replace real data when training Machine Learning models.

Downloads: 3 This Week

Last Update: 2026-04-10
See Project
22

E2B Cookbook

Examples of using E2B

...The repository acts as a practical learning resource for developers who want to integrate AI agents with secure cloud execution environments that allow large language models to run code and interact with tools. The examples illustrate how developers can build AI workflows capable of performing tasks such as data analysis, code execution, and application generation inside isolated sandbox environments. E2B itself provides secure Linux-based sandboxes that enable AI systems to safely run generated code and interact with real computing resources without compromising the host environment. The cookbook organizes examples across multiple frameworks and model providers, allowing developers to experiment with integrations involving models from OpenAI, Anthropic, and other ecosystems.

Downloads: 0 This Week

Last Update: 2026-03-26
See Project
23

AI Data Science Team

An AI-powered data science team of agents

AI Data Science Team is a Python library and agent ecosystem designed to accelerate and automate common data science workflows by modeling them as specialized AI “agents” that can be orchestrated to perform tasks like data cleaning, transformation, analysis, visualization, and machine learning. It provides a modular agent framework where each agent focuses on a step in the typical data science pipeline — for example, loading data from CSV/Excel files, cleaning and wrangling messy datasets, engineering predictive features, building models with AutoML, connecting to SQL databases, and producing visual outputs — all driven by natural language or programmatic instructions. ...

Downloads: 2 This Week

Last Update: 2026-01-26
See Project
24

AutoResearchClaw

Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞

...The system retrieves real academic references from sources such as arXiv and Semantic Scholar to ensure credible citations. It can automatically generate code for experiments, run them in a sandbox environment, and analyze the results with statistical methods. The platform also uses multi-agent debate and automated peer review processes to refine research findings and improve paper quality. By combining literature discovery, experimentation, and writing automation, AutoResearchClaw aims to turn research ideas into conference-ready papers with minimal human intervention.

Downloads: 32 This Week

Last Update: 2026-04-01
See Project
25

Yahoo! Finance market data downloader

Yahoo! Finance market data downloader

Ever since Yahoo! finance decommissioned their historical data API, many programs that relied on it to stop working. yfinance aims to solve this problem by offering a reliable, threaded, and Pythonic way to download historical market data from Yahoo! finance. yfinance aimed to offer a temporary fix to the problem by scraping the data from Yahoo! Finance and returning a the data in the same format as pandas_datareader's get_data_yahoo(), thus keeping the code changes in existing software to a minimum. ...

Downloads: 10 This Week

Last Update: 1 day ago
See Project