Showing 383 open source projects for "git:/git.code.sf.net/p/docfetcher/code"

View related business solutions
  • DataHub is the leading open-source data catalog helping teams discover, understand, and govern their data assets. Icon
    DataHub is the leading open-source data catalog helping teams discover, understand, and govern their data assets.

    Modern Data Catalog and Metadata Platform

    Built on an open source foundation with a thriving community of 13,000+ members, DataHub gives you unmatched flexibility to customize and extend without vendor lock-in. DataHub Cloud is a modern metadata platform with REST and GraphQL APIs that optimize performance for complex queries, essential for AI-ready data management and ML lifecycle support.
    Learn More
  • Remotely access and manage devices to provide on-demand IT support. Icon
    Remotely access and manage devices to provide on-demand IT support.

    Be Efficient Support Remotely

    ISL Light is an easy-to-use remote desktop software for security-conscious users. It comes at a great price-performance. ISL Light is a powerful tool that helps IT staff and support technicians solve problems remotely, either through unattended access, remote support or even though screen-sharing on mobile devices. It works cross-platform and offers 256-bit encrypted sessions with all standard remote access features plus some important extras: session recording, live chat, videocall, multi-monitor support, file transfer, reporting and many more. Users can choose between cloud or on-premise service. ISL Online license does not limit the number of users, workstations and clients you support. It's a reliable and highly secure software used in all industry sectors including banks, hospitals, governmental institutions and insurances.
    Free Trial
  • 1
    Pattern Recognition and Machine Learning

    Pattern Recognition and Machine Learning

    Repository of notes, code and notebooks in Python

    ...The project recreates many of the mathematical concepts and diagrams from the book using executable Jupyter notebooks, allowing readers to experiment directly with the algorithms described in the text. Each section of the repository corresponds to chapters in the book and includes code examples that demonstrate statistical modeling, machine learning methods, and Bayesian inference techniques. These notebooks provide visualizations and computational demonstrations that help clarify complex topics such as probabilistic models, neural networks, kernel methods, and graphical models. The repository also includes implementations of sampling methods, clustering algorithms, and dimensionality reduction techniques used throughout machine learning research.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    PyTorch Transfer-Learning-Library

    PyTorch Transfer-Learning-Library

    Transfer Learning Library for Domain Adaptation, Task Adaptation, etc.

    TLlib is an open-source and well-documented library for Transfer Learning. It is based on pure PyTorch with high performance and friendly API. Our code is pythonic, and the design is consistent with torchvision. You can easily develop new algorithms or readily apply existing algorithms. We appreciate all contributions. If you are planning to contribute back bug-fixes, please do so without any further discussion. If you plan to contribute new features, utility functions or extensions, please first open an issue and discuss the feature with us.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    ISLR-python

    ISLR-python

    An Introduction to Statistical Learning

    ...The datasets used in the book are also included so that users can run experiments directly within the provided notebooks. By translating the statistical learning material into Python code, the repository makes the book’s concepts accessible to a wider community of Python users.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Alphafold2

    Alphafold2

    Unofficial Pytorch implementation / replication of Alphafold2

    ...Once this is replicated, I intend to fold all available amino acid sequences out there in-silico and release it as an academic torrent, to further science. Deepmind has open sourced the official code in Jax, along with the weights! This repository will now be geared towards a straight pytorch translation with some improvements on positional encoding. lhatsk has reported training a modified trunk of this repository, using the same setup as trRosetta, with competitive results. The underlying assumption is that the trunk works on the residue level, and then constitutes to atomic level for the structure module, whether it be SE3 Transformers, E(n)-Transformer, or EGNN doing the refinement.
    Downloads: 2 This Week
    Last Update:
    See Project
  • A privacy-first API that predicts global consumer preferences Icon
    A privacy-first API that predicts global consumer preferences

    Qloo AI adds value to a wide range of Fortune 500 companies in the media, technology, CPG, hospitality, and automotive sectors.

    Through our API, we provide contextualized personalization and insights based on a deep understanding of consumer behavior and more than 575 million people, places, and things.
    Learn More
  • 5
    nlpaug

    nlpaug

    Data augmentation for NLP

    This Python library helps you with augmenting nlp for your machine learning projects. Visit this introduction to understand Data Augmentation in NLP. Augmenter is the basic element of augmentation while Flow is a pipeline to orchestra multi augmenters together.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Machine Learning Glossary

    Machine Learning Glossary

    Machine learning glossary

    ...The content is organized into sections that progressively introduce key ideas from basic machine learning concepts to more advanced mathematical topics. Many pages include diagrams or code examples to illustrate how algorithms work in practice. Because the project emphasizes accessibility, it is particularly useful for beginners who want a conceptual overview of machine learning terminology before diving into more technical research papers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Spheroid_segmentation

    Spheroid_segmentation

    Deep learning networks for spheroid segmentation

    To accelerate the analysis of tumors' spheroids, different deep learning networks were trained to automatize the segmentation process. The code provides the trained networks based on Vgg16, Vgg19, ResNet18, and ResNet50 ready to be used for segmentation purposes. It also provides Matlab functions ready to be used to train new networks, segment new images, and measure the quality of the training using different quantitative parameters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    pyprobml

    pyprobml

    Python code for "Probabilistic Machine learning" book by Kevin Murphy

    ...See also probml-utils for some utility code that is shared across multiple notebooks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Catalyst

    Catalyst

    Accelerated deep learning R&D

    Catalyst is a PyTorch framework for accelerated Deep Learning research and development. It allows you to write compact but full-featured Deep Learning pipelines with just a few lines of code. With Catalyst you get a full set of features including a training loop with metrics, model checkpointing and more, all without the boilerplate. Catalyst is focused on reproducibility, rapid experimentation, and codebase reuse so you can break the cycle of writing another regular train loop and make something totally new. Catalyst is compatible with Python 3.6+. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Award-winning proxy networks, AI-powered web scrapers, and business-ready datasets for download.
 Icon
    Award-winning proxy networks, AI-powered web scrapers, and business-ready datasets for download.


    How the world collects public web data

    Bright Data is a leading data collection platform, enabling businesses to collect crucial structured and unstructured data from millions of websites through our proprietary technology. Our proxy networks give you access to sophisticated target sites using precise geo-targeting. You can also use our tools to unblock tough target sites, accomplish SERP-specific data collection tasks, manage and optimize your proxy performance as well as automating all of your data collection needs.
    Learn More
  • 10
    TensorFlowOnSpark

    TensorFlowOnSpark

    TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters

    By combining salient features from the TensorFlow deep learning framework with Apache Spark and Apache Hadoop, TensorFlowOnSpark enables distributed deep learning on a cluster of GPU and CPU servers. It enables both distributed TensorFlow training and inferencing on Spark clusters, with a goal to minimize the amount of code changes required to run existing TensorFlow programs on a shared grid.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DeeProtGO

    DeeProtGO

    DeeProtGO is a deep learning model for predicting GO terms of proteins

    This project contains the source code of DeeProtGO as well as an example of its use when predicting GO terms of the biological process sub-ontology for eukaryotic proteins.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    LayoutParser

    LayoutParser

    A Unified Toolkit for Deep Learning Based Document Image Analysis

    With the help of state-of-the-art deep learning models, Layout Parser enables extracting complicated document structures using only several lines of code. This method is also more robust and generalizable as no sophisticated rules are involved in this process. A complete instruction for installing the main Layout Parser library and auxiliary components. Learn how to load DL Layout models and use them for layout detection. The full list of layout models currently available in Layout Parser. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    The fastai book

    The fastai book

    The fastai book, published as Jupyter Notebooks

    ...These notebooks are used for a MOOC and form the basis of this book, which is currently available for purchase. It does not have the same GPL restrictions that are on this repository. The code in the notebooks and python .py files is covered by the GPL v3 license; see the LICENSE file for details. The remainder (including all markdown cells in the notebooks and other prose) is not licensed for any redistribution or change of format or medium, other than making copies of the notebooks or forking this repo for your own private use. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PyTorch Handbook

    PyTorch Handbook

    The pytorch handbook is an open source book

    ...The material is written with a practical focus so that readers can follow along and run the provided examples successfully. Each tutorial is tested to ensure that the code runs correctly, making the repository particularly useful for beginners who want reliable learning materials. The handbook emphasizes hands-on learning through real code examples rather than purely theoretical explanations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    EasyNLP

    EasyNLP

    EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

    ...It is seamlessly integrated to Platform of AI (PAI) products, including PAI-DSW for development, PAI-DLC for cloud-native training, PAI-EAS for serving, and PAI-Designer for zero-code model training.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    TensorFlow Backend for ONNX

    TensorFlow Backend for ONNX

    Tensorflow Backend for ONNX

    Open Neural Network Exchange (ONNX) is an open standard format for representing machine learning models. ONNX is supported by a community of partners who have implemented it in many frameworks and tools. TensorFlow Backend for ONNX makes it possible to use ONNX models as input for TensorFlow. The ONNX model is first converted to a TensorFlow model and then delegated for execution on TensorFlow to produce the output. This is one of the two TensorFlow converter projects which serve different...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Deep Learning course

    Deep Learning course

    Slides and Jupyter notebooks for the Deep Learning lectures

    Slides and Jupyter notebooks for the Deep Learning lectures at Master Year 2 Data Science from Institut Polytechnique de Paris. This course is being taught at as part of Master Year 2 Data Science IP-Paris. Note: press "P" to display the presenter's notes that include some comments and additional references. This lecture is built and maintained by Olivier Grisel and Charles Ollion.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    YOLOv3

    YOLOv3

    Object detection architectures and models pretrained on the COCO data

    ...Export and deploy your YOLOv5 model with just 1 line of code. There are also loads of quickstart guides and tutorials available to get your model where it needs to be. Create state of the art deep learning models with YOLOv5
    Downloads: 73 This Week
    Last Update:
    See Project
  • 19
    PandaOCR

    PandaOCR

    Multifunctional OCR Image and Text Recognition

    At present, the newly refactored PandaOCR.Pro professional version has been released. It is faster and more stable, with richer interfaces and easier operation. It is recommended for you to use it! The normal version will continue to be maintained, and all interfaces will be retained but no new functions will be added. The reason why the version number of the professional version starts from 5.x is that the normal version will be updated in the future, so a period of version number is...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Machine Learning PyTorch Scikit-Learn

    Machine Learning PyTorch Scikit-Learn

    Code Repository for Machine Learning with PyTorch and Scikit-Learn

    Initially, this project started as the 4th edition of Python Machine Learning. However, after putting so much passion and hard work into the changes and new topics, we thought it deserved a new title. So, what’s new? There are many contents and additions, including the switch from TensorFlow to PyTorch, new chapters on graph neural networks and transformers, a new section on gradient boosting, and many more that I will detail in a separate blog post. For those who are interested in knowing...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21
    CodeSearchNet

    CodeSearchNet

    Datasets, tools, and benchmarks for representation learning of code

    CodeSearchNet is a large-scale dataset and research benchmark designed to advance the development of systems that retrieve source code using natural language queries. The project was created through collaboration between GitHub and Microsoft Research and aims to support research on semantic code search and program understanding. The dataset contains millions of pairs of source code functions and corresponding documentation comments extracted from open-source repositories. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    MACE

    MACE

    Deep learning inference framework optimized for mobile platforms

    Mobile AI Compute Engine (or MACE for short) is a deep learning inference framework optimized for mobile heterogeneous computing on Android, iOS, Linux and Windows devices. Runtime is optimized with NEON, OpenCL and Hexagon, and Winograd algorithm is introduced to speed up convolution operations. The initialization is also optimized to be faster. Chip-dependent power options like big.LITTLE scheduling, Adreno GPU hints are included as advanced APIs. UI responsiveness guarantee is sometimes...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Machine Learning in Asset Management

    Machine Learning in Asset Management

    Machine Learning in Asset Management

    Machine Learning in Asset Management is a research-oriented repository that explores how machine learning techniques can be applied to portfolio management and asset allocation. The project collects educational materials, code implementations, and experiments related to applying artificial intelligence methods in financial markets. It covers topics such as predictive modeling for asset prices, portfolio optimization strategies, and risk management using machine learning algorithms. The repository also includes references to academic research, tutorials, and datasets that help users understand how machine learning can enhance traditional investment strategies. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    AI Platform Training and Prediction
    ...The repository covers the full machine learning lifecycle, including data preprocessing, model training, hyperparameter tuning, evaluation, and prediction serving. It also demonstrates how to scale from local training to distributed cloud-based training without major code changes, making it a valuable resource for transitioning workloads to production environments. Although the repository has been archived, it still provides extensive reference implementations and practical examples for learning cloud-based ML workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    igel

    igel

    Machine learning tool that allows you to train and test models

    A delightful machine learning tool that allows you to train/fit, test, and use models without writing code. The goal of the project is to provide machine learning for everyone, both technical and non-technical users. I sometimes needed a tool sometimes, which I could use to fast create a machine learning prototype. Whether to build some proof of concept, create a fast draft model to prove a point or use auto ML. I find myself often stuck writing boilerplate code and thinking too much about where to start. ...
    Downloads: 3 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB