Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
AI Models
Search Results

Search Results for "git:/git.code.sf.net/p/docfetcher/code" - Page 5

x

Sort By:

Relevance

Clear All Filters

OS

Linux 128
Mac 118
Windows 116
More...
BSD 86
ChromeOS 86

Category

Artificial Intelligence 128
Scientific/Engineering 3
Multimedia 2
Security 1

License

OSI-Approved Open Source 93
Creative Commons Attribution License 2
Other License 2

Translations

English 3
Chinese (Simplified) 1
Chinese (Traditional) 1

Programming Language

Python 95
Unix Shell 5
JavaScript 3
C++ 2
More...
Go 2
TypeScript 2
C 1
C# 1
Swift 1

Showing 128 open source projects for "git:/git.code.sf.net/p/docfetcher/code"

View related business solutions

AI Models Linux Clear Filters & Widen Search

Securden Windows Privilege Manager
For IT security teams

Remove local administrator rights on Windows servers and endpoints. Seamlessly elevate applications for standard users. Grant time-limited rights on-demand. Control application usage by remote employees through whitelisting and blacklisting.

Learn More
Find out just how much your login box can do for your customer | Auth0
With over 53 social login options, you can fast-track the signup and login experience for users.

From improving customer experience through seamless sign-on to making MFA as easy as a click of a button – your login box must find the right balance between user convenience, privacy and security.

Sign up
1

PRM800K

800,000 step-level correctness labels on LLM solutions to MATH problem

...The repository releases the raw labels and the labeler instructions used in two project phases, enabling researchers to study how human raters graded intermediate reasoning. Data are stored as newline-delimited JSONL files tracked with Git LFS, where each line is a full solution sample that can contain many step-level labels and rich metadata such as labeler UUIDs, timestamps, generation identifiers, and quality-control flags. Each labeled step can include multiple candidate completions with ratings of -1, 0, or +1, optional human-written corrections (phase 1), and a chosen completion index, along with a final finish reason such as found_error, solution, bad_problem, or give_up.

Downloads: 0 This Week

Last Update: 2 days ago
See Project
2

DiT (Diffusion Transformers)

Official PyTorch Implementation of "Scalable Diffusion Models"

DiT (Diffusion Transformer) is a powerful architecture that applies transformer-based modeling directly to diffusion generative processes for high-quality image synthesis. Unlike CNN-based diffusion models, DiT represents the diffusion process in the latent space and processes image tokens through transformer blocks with learned positional encodings, offering scalability and superior sample quality. The model architecture parallels large language models but for image tokens—each block...

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
3

LLaMA.go

llama.go is like llama.cpp in pure Golang

llama.go is like llama.cpp in pure Golang. The code of the project is based on the legendary ggml.cpp framework of Georgi Gerganov written in C++ with the same attitude to performance and elegance. Both models store FP32 weights, so you'll needs at least 32Gb of RAM (not VRAM or GPU RAM) for LLaMA-7B. Double to 64Gb for LLaMA-13B.

Downloads: 0 This Week

Last Update: 2023-08-25
See Project
4

VALL-E

PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)

We introduce a language modeling approach for text to speech synthesis (TTS). Specifically, we train a neural codec language model (called VALL-E) using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather than continuous signal regression as in previous work. During the pre-training stage, we scale up the TTS training data to 60K hours of English speech which is hundreds of times larger than existing systems....

Downloads: 2 This Week

Last Update: 2023-04-14
See Project
We help you deliver Virtual and Hybrid Events using our Award Winning end-to-end Event Management Platform
Designed by event planners for event planners, the EventsAIR platform gives you the ability to manage your event, conference, meeting or function with

EventsAIR have been anticipating and responding to the ever-changing event industry needs for over 30 years, providing innovative solutions that empower event organizers to create successful events around the globe.

Learn More
5

AICommand

ChatGPT integration with Unity Editor

AICommand is a proof-of-concept integration that lets you control the Unity Editor using natural language via ChatGPT. Instead of manually hunting through menus or writing editor scripts, you can prompt the editor to perform tasks, generate snippets, and automate actions. The project showcases an emerging workflow where LLMs augment game and tooling development by understanding intent and producing editor-side outcomes. It provides a minimal setup that connects your OpenAI API key and...

Downloads: 0 This Week

Last Update: 2025-10-28
See Project
6

ConvNeXt V2

Code release for ConvNeXt V2 model

...The result is a convnet that competes strongly with transformer architectures on recognition benchmarks while being efficient and hardware-friendly. The repository provides official PyTorch implementations for multiple model sizes (Atto, Femto, Pico, up through Huge), conversion from JAX weights, code for pretraining/fine-tuning, and pretrained checkpoints. It supports both self-supervised pretraining and supervised fine-tuning.

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
7

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT

minGPT is a minimalist, educational re-implementation of the GPT (Generative Pretrained Transformer) architecture built in PyTorch, designed by Andrej Karpathy to expose the core structure of a transformer-based language model in as few lines of code as possible. It strips away extraneous bells and whistles, aiming to show how a sequence of token indices is fed into a stack of transformer blocks and then decoded into the next token probabilities, with both training and inference supported. Because the whole model is around 300 lines of code, users can follow each step—from embedding lookup, positional encodings, multi-head attention, feed-forward layers, to output heads—and thus demystify how GPT-style models work beneath the surface. ...

Downloads: 0 This Week

Last Update: 2025-10-15
See Project
8

Video Pre-Training

Learning to Act by Watching Unlabeled Online Videos

The Video PreTraining (VPT) repository provides code and model artifacts for a project where agents learn to act by watching human gameplay videos—specifically, gameplay of Minecraft—using behavioral cloning. The idea is to learn general priors of control from large-scale, unlabeled video data, and then optionally fine-tune those priors for more goal-directed behavior via environment interaction.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
9

Mask2Former

Code release for "Masked-attention Mask Transformer

Mask2Former is a unified segmentation architecture that handles semantic, instance, and panoptic segmentation with one model and one training recipe. Its core idea is to cast segmentation as mask classification: a transformer decoder predicts a set of mask queries, each with an associated class score, eliminating the need for task-specific heads. A pixel decoder fuses multi-scale features and feeds masked attention in the transformer so each query focuses computation on its current spatial...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
PairSoft | AP Automation and Doc Management
Free your team from manual processes.

Streamline operations and elevate your team's efficiency with PairSoft. Our AP automation, procurement, and document management solutions eliminate manual processes, cut costs, and free your team to focus on strategic initiatives. Experience our state-of-the-art invoice-to-pay solution, now integrated with advanced AI technology for faster, smarter results. Our customers report a significant 70% reduction in approval times and annual savings of $62,000 in employee hours. At PairSoft, we aim to transform your business operations through automation. Explore the future of automation at pairsoft.com, where you can leverage cutting-edge features like invoice capture, OCR, and comprehensive AP automation to transform your workflow. Whether you are a small business or a large enterprise, our solutions are designed to scale with your needs, providing robust functionality and ease of use. Join the growing number of businesses that trust PairSoft.

Learn More
10

GLIDE (Text2Im)

GLIDE: a diffusion-based text-conditional image synthesis model

...It demonstrates how diffusion-based generative models can be conditioned on text to produce highly detailed and coherent visual outputs. The repository provides both model code and pretrained checkpoints, making it possible for researchers and developers to experiment with text-to-image synthesis. GLIDE includes advanced techniques such as classifier-free guidance, which improves the quality and alignment of generated images with the input text. The project also offers sampling scripts and utilities for exploring how diffusion models can be applied to multimodal tasks. ...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
11

Image GPT

Large-scale autoregressive pixel model for image generation by OpenAI

Image-GPT is the official research code and models from OpenAI’s paper Generative Pretraining from Pixels. The project adapts GPT-2 to the image domain, showing that the same transformer architecture can model sequences of pixels without altering its fundamental structure. It provides scripts to download pretrained checkpoints of different model sizes (small, medium, large) trained on large-scale datasets and includes utilities for handling color quantization with a 9-bit palette. ...

Downloads: 3 This Week

Last Update: 9 hours ago
See Project
12

MUSE

A library for Multilingual Unsupervised or Supervised word Embeddings

...It supports both supervised alignment with seed dictionaries and unsupervised alignment that starts without parallel data by using adversarial initialization followed by Procrustes refinement. The code can align pre-trained monolingual embeddings (such as fastText) across dozens of languages and provides standardized evaluation scripts and dictionaries. By mapping languages into a common vector space, MUSE makes it straightforward to build cross-lingual applications where resources are scarce for some languages. The training and evaluation pipeline is lightweight and fast, so experimenting with different languages or initialization strategies is easy. ...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
13

Improved GAN

Code for the paper "Improved Techniques for Training GANs"

Improved-GAN is the official code release from OpenAI accompanying the research paper Improved Techniques for Training GANs. It provides implementations of experiments conducted on datasets such as MNIST, SVHN, CIFAR-10, and ImageNet. The project focuses on demonstrating enhanced training methods for Generative Adversarial Networks, addressing stability and performance issues that were common in earlier GAN models.

Downloads: 1 This Week

Last Update: 9 hours ago
See Project
14

InfoGAN

Code for reproducing key results in the paper

...That extra incentive encourages the generator to structure its latent space in a way where certain latent variables control meaningful, distinct factors (e.g. rotation, width, stroke thickness) in the output images. The repository includes code for experiments (e.g. on MNIST), launcher scripts, and some tests. It depends on a development version of TensorFlow (the code expects features not in older stable releases), and also uses other libraries like prettytensor and progressbar.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
15

SG2Im

Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201

...This separation lets the model reason about geometry and composition before committing to texture and color, improving spatial fidelity. The repository includes training code, datasets, and evaluation scripts so researchers can reproduce baselines and extend components such as the graph encoder or image generator. In practice, sg2im demonstrates how structured semantics can guide generative models to produce controllable, compositional imagery.

Downloads: 0 This Week

Last Update: 2025-10-10
See Project
16

Leanstral

Open-source code agent designed for Lean 4

Leanstral is an open-weight large language model developed by Mistral AI and specifically designed as a code agent for the Lean 4 proof assistant, enabling advanced interaction with formal mathematics and program verification systems. The model is built to understand and generate Lean 4 code, which is used to express complex mathematical constructs as well as formal software specifications. By focusing on theorem proving and formal reasoning, Leanstral represents a specialized direction within large language models, targeting domains that require strict correctness and logical rigor rather than general conversational tasks. ...

Downloads: 0 This Week

Last Update: 2026-03-17
See Project
17

Nemotron 3

Large language model developed and released by NVIDIA

...This configuration supports a massive context length of up to 1 million tokens, making it suitable for long-context reasoning, agentic tasks, extended dialogues, and applications like code generation or document summarization.

Downloads: 0 This Week

Last Update: 2026-01-07
See Project
18

Mellum-4b-base

JetBrains’ 4B parameter code model for completions

Mellum-4b-base is JetBrains’ first open-source large language model designed and optimized for code-related tasks. Built with 4 billion parameters and a LLaMA-style architecture, it was trained on over 4.2 trillion tokens across multiple programming languages, including datasets such as The Stack, StarCoder, and CommitPack. With a context window of 8,192 tokens, it excels at code completion, fill-in-the-middle tasks, and intelligent code suggestions for professional developer tools and IDEs. ...

Downloads: 0 This Week

Last Update: 2025-09-11
See Project
19

VaultGemma

VaultGemma: 1B DP-trained Gemma variant for private NLP tasks

VaultGemma is a sub-1B parameter variant of Google’s Gemma family that is pre-trained from scratch with Differential Privacy (DP), providing mathematically backed guarantees that its outputs do not reveal information about any single training example. Using DP-SGD with a privacy budget across a large English-language corpus (web documents, code, mathematics), it prioritizes privacy over raw utility. The model follows a Gemma-2–style architecture, outputs text from up to 1,024 input tokens, and is intended to be instruction-tuned for downstream language understanding and generation tasks. Training ran on TPU v6e using JAX and Pathways with privacy-preserving algorithms (DP-SGD, truncated Poisson subsampling) and DP scaling laws to balance compute and privacy budgets. ...

Downloads: 0 This Week

Last Update: 2025-09-17
See Project
20

gpt-oss-20b

OpenAI’s compact 20B open model for fast, agentic, and local use

...With 21B total parameters and 3.6B active parameters (MoE), it fits within 16GB of memory thanks to native MXFP4 quantization. Designed for high-performance reasoning, it supports Harmony response format, function calling, web browsing, and code execution. Like its larger sibling (gpt-oss-120b), it offers adjustable reasoning depth and full chain-of-thought visibility for better interpretability. It’s released under a permissive Apache 2.0 license, allowing unrestricted commercial and research use. GPT-OSS-20B is compatible with Transformers, vLLM, Ollama, PyTorch, and other tools. ...

Downloads: 0 This Week

Last Update: 2025-08-05
See Project
21

gpt-oss-120b

OpenAI’s open-weight 120B model optimized for reasoning and tooling

...Developers can control the reasoning level (low, medium, high) to balance speed and depth depending on the task. Released under the Apache 2.0 license, it enables both commercial and research applications. The model supports function calling, web browsing, and code execution, streamlining intelligent agent development.

Downloads: 0 This Week

Last Update: 2025-08-05
See Project
22

DeepSeek-V3.1-Terminus

685B model with improved agents and consistency

...It improves language consistency, reducing mixed Chinese-English outputs and eliminating abnormal characters, enhancing reliability in multilingual scenarios. The update also refines agentic capabilities, especially for the Code Agent and Search Agent, leading to better tool integration and query handling. Benchmarks show small but notable gains, such as raising MMLU-Pro from 84.8 to 85.0, GPQA-Diamond from 80.1 to 80.7, and SWE Verified from 66.0 to 68.4, along with significant improvements in agent benchmarks like BrowseComp (30.0 → 38.5) and Terminal-bench (31.3 → 36.7). ...

Downloads: 0 This Week

Last Update: 2025-09-24
See Project
23

BLEURT-20-D12

Custom BLEURT model for evaluating text similarity using PyTorch

...Unlike standard BLEURT models from TensorFlow, this version is built from a custom PyTorch transformer library. It requires installing the model-specific library from GitHub to function properly. Once set up, it can be used to compute similarity scores with minimal code. BLEURT-20-D12 enables more flexible deployment in PyTorch-based workflows for evaluating language generation outputs.

Downloads: 0 This Week

Last Update: 2025-07-02
See Project
24

Hermes 4

Hermes 4 FP8: hybrid reasoning Llama-3.1-405B model by Nous Research

...It introduces a hybrid reasoning mode with explicit <think> segments, enabling the model to deliberate deeply when needed and switch to faster responses when desired. Post-training improvements include a vastly expanded corpus with ~60B tokens, boosting performance across math, code, STEM, logic, creativity, and structured outputs. The model is designed for schema adherence, producing valid JSON and repairing malformed outputs, making it highly suitable for tool use and function calling. Hermes 4 is engineered for superior steerability with reduced refusal rates, aligning responses to user values while preserving assistant quality. ...

Downloads: 0 This Week

Last Update: 2025-09-01
See Project
25

wav2vec2-large-xlsr-53-portuguese

Portuguese ASR model fine-tuned on XLSR-53 for 16kHz audio input

wav2vec2-large-xlsr-53-portuguese is an automatic speech recognition (ASR) model fine-tuned on Portuguese using the Common Voice 6.1 dataset. It is based on Facebook’s wav2vec2-large-xlsr-53, a multilingual self-supervised learning model, and is optimized to transcribe Portuguese speech sampled at 16kHz. The model performs well without a language model, though adding one can improve word error rate (WER) and character error rate (CER). It achieves a WER of 11.3% (or 9.01% with LM) on Common...

Downloads: 0 This Week

Last Update: 2025-07-01
See Project

Previous
1
2
3
4
You're on page 5
6
Next

Related Searches

text to speech

tts

text-to-speech tts

learning english

hindi text to speech

asp.net e learning system

ai code trading

ai

Related Categories

Artificial Intelligence

Scientific/Engineering

Multimedia

Security

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise