Showing 23 open source projects for "sandbox:/mnt/data/project_plan.pod"

View related business solutions
  • QA Wolf | We Write, Run and Maintain Tests Icon
    QA Wolf | We Write, Run and Maintain Tests

    For developer teams searching for a testing software

    QA Wolf is an AI-native service that delivers 80% automated E2E test coverage for web & mobile apps in weeks not years.
    Learn More
  • Rev Your Digital Product Delivery Engine Icon
    Rev Your Digital Product Delivery Engine

    Enterprise-grade platform designed to connect strategy, planning, and execution across digital product development and software delivery

    Planview links your technology vision directly to teams' daily work, providing complete visibility and control over your digital product delivery ecosystem.
    Learn More
  • 1
    data.table

    data.table

    Extends base R’s data for high-performance data manipulation

    data.table is an R package that extends base R’s data.frame for high-performance data manipulation. It offers concise syntax, blazing speed, and memory-efficient operations. It supports fast file reading/writing, joins, grouping, reshaping, and updates by reference. It is heavily used in large data workflows, big data in R, production pipelines, etc. Extremely efficient grouping/aggregation/summarization; can handle very large datasets (hundreds of millions to billions of rows) in memory (if available). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    NYC Taxi Data

    NYC Taxi Data

    Import public NYC taxi and for-hire vehicle (Uber, Lyft)

    The nyc-taxi-data repository is a rich dataset and exploratory project around New York City taxi trip records. It collects and preprocesses large-scale trip datasets (fares, pickup/dropoff, timestamps, locations, passenger counts) to enable data analysis, modeling, and visualization efforts. The project includes scripts and notebooks for cleaning and filtering the raw data, memory-efficient processing for large CSV/Parquet files, and aggregation workflows (e.g. trips per hour, heatmaps of pickups/dropoffs). ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    dplyr

    dplyr

    dplyr: A grammar of data manipulation

    dplyr is an R package that provides a consistent and intuitive grammar for data manipulation, enabling users to filter, arrange, summarize, and transform data efficiently. Part of the tidyverse ecosystem, dplyr simplifies complex data operations through a clear and readable syntax, whether working with data frames, tibbles, or databases. It is widely used in data science and statistical analysis workflows.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    RStudio Cheatsheets

    RStudio Cheatsheets

    Curated collection of official cheat sheets for data science tools

    ...It covers topics such as data wrangling, data import, modeling, visualization, RStudio IDE shortcuts, Shiny development, and the tidyverse suite (dplyr, ggplot2, tidyr, purrr). These cheat sheets are widely used by R learners, educators, and practitioners as quick reference tools, and they often ship with RStudio by default or are linked from RStudio’s help/documentation pages.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Gearset | The complete Salesforce DevOps solution Icon
    Gearset | The complete Salesforce DevOps solution

    Salesforce DevOps done right.

    Gearset is the only platform you need for unparalleled deployment success, continuous delivery, automated testing and backups.
    Learn More
  • 5
    Shiny

    Shiny

    Build interactive web apps directly from R with Shiny framework

    Shiny is an R package from RStudio that enables users to build interactive web applications using R without requiring knowledge of JavaScript, HTML, or CSS. It allows statisticians and data scientists to turn their analyses into fully functional web dashboards with reactive elements, data inputs, visualizations, and controls, making data communication more effective and dynamic.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    rollama

    rollama

    Wrap the Ollama API, which allows you to run different LLMs

    rollama is an R package that provides a convenient interface for interacting with local large language models through the Ollama API, bringing modern AI capabilities into the R ecosystem. It is designed to make LLM usage accessible to data scientists and researchers who work primarily in R, allowing them to generate text, analyze data, and create embeddings without relying on external cloud services. The package emphasizes reproducibility and privacy by enabling local execution of models, which is especially valuable for sensitive or research-oriented workflows. It supports common LLM tasks such as text generation, annotation, and embedding creation, making it useful for tasks like document analysis and data labeling. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    plotly

    plotly

    An interactive graphing library for R

    This part of the book teaches you how to leverage the plotly R package to create a variety of interactive graphics. There are two main ways to creating a plotly object: either by transforming a ggplot2 object (via ggplotly()) into a plotly object or by directly initializing a plotly object with plot_ly()/plot_geo()/plot_mapbox(). Both approaches have somewhat complementary strengths and weaknesses, so it can pay off to learn both approaches. Moreover, both approaches are an implementation of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    purrr

    purrr

    A functional programming toolkit for R

    purrr enhances R’s functional programming capabilities by providing a consistent set of tools for working with lists and vectors, enabling safer and more expressive iteration compared to base R’s loop functions.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    Awesome Network Analysis

    Awesome Network Analysis

    A curated list of awesome network analysis resources

    ...It covers multiple programming languages and domains like sociology, biology, and computer science. This repository serves as a central reference for researchers, analysts, and developers working with network data.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Dominate AI Search Results Icon
    Dominate AI Search Results

    Generative Al is shaping brand discovery. AthenaHQ ensures your brand leads the conversation.

    AthenaHQ is a cutting-edge platform for Generative Engine Optimization (GEO), designed to help brands optimize their visibility and performance across AI-driven search platforms like ChatGPT, Google AI, and more.
    Learn More
  • 10
    R Color Palettes

    R Color Palettes

    Comprehensive list of color palettes available in R

    This repository is a curated collection of color palettes crafted or curated for data visualization in R. The goal is to provide designers, data scientists, and R users with aesthetically pleasing, perceptually consistent color schemes that work well for plots, maps, and graphics. The repo contains static files listing palette definitions (e.g. hex codes, named hues), sample visualizations showing how each palette performs under different contexts (categorical, sequential, diverging), and helper functions/scripts to import or use the palettes in R. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    magrittr

    magrittr

    Improve the readability of R code with the pipe

    magrittr introduces the pipe operator (%>%) and related functional utilities into R. It underlies the powerful piped syntax widely adopted in tidyverse workflows by enabling left-hand argument passing and providing helpers like compound assignment pipes and exposition pipes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    blogdown

    blogdown

    Create Blogs and Websites with R Markdown

    ...Developed by Yihui Xie and team, it provides functions to initialize sites, write posts, manage themes, and deploy with minimal fuss. It seamlessly blends R code chunks and web content, ideal for data storytellers and technical bloggers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Statistical Rethinking 2024

    Statistical Rethinking 2024

    This course teaches data analysis

    The 2024 repository is the most recent version of the course, reflecting ongoing refinements in pedagogy, statistical modeling techniques, and coding practices. It provides updated notebooks, R scripts, and model examples, some streamlined and restructured compared to previous years. The 2024 repo also highlights the transition toward more robust Stan models and integration with newer Bayesian workflow practices, continuing to emphasize accessibility for learners while modernizing the tools....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ggthemes

    ggthemes

    Additional themes, scales, and geoms for ggplot2

    ...It is often used to make ggplot2 plots adhere to aesthetic styles from famous news outlets, scientific journals, or presentation decks. Additional color scales and palettes for discrete and continuous data to match theme aesthetics. Extensive documentation and examples for each theme / scale so users can see how plots look and tweak them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Statistical Rethinking 2023

    Statistical Rethinking 2023

    Statistical Rethinking Course for Jan-Mar 2023

    ...It continues to provide scripts for lectures and tutorials, while integrating refinements to examples, notation, and computational workflows introduced that year. Compared with 2022, some models are rewritten for clarity, and teaching materials reflect refinements in McElreath’s evolving presentation of Bayesian data analysis. Students following the 2023 lecture videos use this repository as their coding reference. There are 10 weeks of instruction. Links to lecture recordings will appear in this table. Weekly problem sets are assigned on Fridays and due the next Friday, when we discuss the solutions in the weekly online meeting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    AI-Agent-Host

    AI-Agent-Host

    The AI Agent Host is a module-based development environment.

    ...Being data-aware involves connecting a language model to other sources of data, enabling a comprehensive understanding and analysis of information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Statistical Rethinking 2022

    Statistical Rethinking 2022

    Statistical Rethinking course winter 2022

    This repository hosts the 2022 version of the Statistical Rethinking course. It contains course materials such as R scripts, notebooks, and worked examples aligned with McElreath’s textbook. The code emphasizes Bayesian data analysis using R, the rethinking package, and Stan models. It includes lecture code files, example datasets, and structured exercises that parallel the topics covered in the lectures (probability, regression, model comparison, Bayesian updating). The repo functions as a direct hands-on reference for students following the 2022 recorded lecture series. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Reproducible-research

    Reproducible-research

    A Reproducible Data Analysis Workflow with R Markdown, Git, Make, etc.

    In this tutorial, we describe a workflow to ensure long-term reproducibility of R-based data analyses. The workflow leverages established tools and practices from software engineering. It combines the benefits of various open-source software tools including R Markdown, Git, Make, and Docker, whose interplay ensures seamless integration of version management, dynamic report generation conforming to various journal styles, and full cross-platform and long-term computational reproducibility. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    benchm-ml

    benchm-ml

    A benchmark of commonly used open source implementations

    This repository is designed to provide a minimal benchmark framework comparing commonly used machine learning libraries in terms of scalability, speed, and classification accuracy. The focus is on binary classification tasks without missing data, where inputs can be numeric or categorical (after one-hot encoding). It targets large scale settings by varying the number of observations (n) up to millions and the number of features (after expansion) to about a thousand, to stress test different implementations. The benchmarks cover algorithms like logistic regression, random forest, gradient boosting, and deep neural networks, and they compare across toolkits such as scikit-learn, R packages, xgboost, H2O, Spark MLlib, etc. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    bbplot

    bbplot

    R package that helps create and export ggplot2 charts

    ...It offers templates and defaults that reduce styling overhead so users can focus on data and storytelling rather than aesthetic minutiae. Because visual consistency is important in media, bbplot helps non-designers build plots that align with professional publication standards. The repository includes documentation, vignettes, example plots, and guidelines for customization (e.g. switching colors, modifying typography).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    DataScienceR

    DataScienceR

    a curated list of R tutorials for Data Science, NLP

    The DataScienceR repository is a curated collection of tutorials, sample code, and project templates for learning data science using the R programming language. It includes an assortment of exercises, sample datasets, and instructional code that cover the core steps of a data science project: data ingestion, cleaning, exploratory analysis, modeling, evaluation, and visualization. Many of the modules demonstrate best practices in R, such as using the tidyverse, R Markdown, modular scripting, and reproducible workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Investing

    Investing

    Investing Returns on the Market as a Whole

    This repository, owned by the user zonination (Zoni Nation), presents a data visualization and analysis project on long-term returns from broad stock market indexes, especially the S&P 500. The author gathers historical price data (adjusted for inflation and dividends) and computes growth trajectories under a “buy and hold” strategy over decades. The key insight illustrated is that over sufficiently long holding periods (e.g. 40 years), the stock market stabilizes and nearly always yields positive returns, even accounting for extreme market crashes and recessions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    RStan

    RStan

    RStan, the R interface to Stan

    RStan is the R interface to Stan, a C++ library for statistical modeling and high-performance statistical computation. It lets users specify models in the Stan modeling language (for Bayesian inference), compile them, and perform inference from R. Key inference approaches include full Bayesian inference via Hamiltonian Monte Carlo (specifically the No-U-Turn Sampler, NUTS), approximate Bayesian inference via variational methods, and optimization (penalized likelihood). RStan integrates with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB