Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
AI Models
Search Results

Search Results for "git:/git.code.sf.net/p/docfetcher/code" - Page 3

x

Sort By:

Relevance

Clear All Filters

OS

Windows 119
Mac 117
Linux 116
More...
BSD 86
ChromeOS 86

Category

Artificial Intelligence 119
Scientific/Engineering 2
Multimedia 1
Security 1

License

OSI-Approved Open Source 85
Creative Commons Attribution License 2
Other License 2

Translations

English 3
Chinese (Simplified) 1
Chinese (Traditional) 1

Programming Language

Python 85
JavaScript 3
TypeScript 3
Unix Shell 3
More...
C++ 2
Go 2
C 1
C# 1
Swift 1

Showing 119 open source projects for "git:/git.code.sf.net/p/docfetcher/code"

View related business solutions

AI Models Windows Clear Filters & Widen Search

MaintainX is the world-leading mobile-first workflow management platform for industrial and frontline workers.
Trusted by Operational Leaders Across the Globe

Your day-to-day maintenance tasks, simplified. MaintainX eliminates the paperwork, so you can spend less time on your clipboard and more time getting things done.

Learn More
The Apple Device Management and Security Platform
For IT teams at organizations that run on Apple

Achieve harmony across your Apple device fleet with Kandji's unmatched management and security capabilities.

Learn More
1

HunyuanImage-3.0

A Powerful Native Multimodal Model for Image Generation

...The model is intended to be competitive with closed-source image generation systems, aiming for high fidelity, prompt adherence, fine detail, and even “world knowledge” reasoning (i.e. leveraging context, semantics, or common sense in generation). The GitHub repo includes code, scripts, model loading instructions, inference utilities, prompt handling, and integration with standard ML tooling (e.g. Hugging Face / Transformers).

1 Review

Downloads: 6 This Week

Last Update: 2026-02-03
See Project
2

Anthropic SDK TypeScript

Access to Anthropic's safety-first language model APIs

anthropic-sdk-typescript is the TypeScript / JavaScript client library for the Anthropic REST API, enabling backend or Node.js usage of models like Claude. It wraps API endpoints for creating messages, streaming responses, and managing parameters in a type-safe TS environment. The library is designed for server-side use, interfacing with REST, and is stable for integration in web services or backend agents. Example usage shows how to instantiate the Anthropic client, call...

Downloads: 4 This Week

Last Update: 4 days ago
See Project
3

Ling

Ling is a MoE LLM provided and open-sourced by InclusionAI

...As more developers and researchers engage with the platform, we can expect rapid advancements and improvements, leading to even more sophisticated applications. Model inference and API code (e.g. integration with Transformers). This collaborative approach accelerates development and ensures that the models remain at the forefront of technology, addressing emerging challenges in various fields.

Downloads: 0 This Week

Last Update: 2025-09-30
See Project
4

FireRedTTS-2

Long-form streaming TTS system for multi-speaker dialogue generation

FireRedTTS2 is a next-generation open-source text-to-speech (TTS) system focused on long-form, streaming speech synthesis for multi-speaker dialogue, delivering stable natural speech with context-aware prosody and reliable speaker transitions that support real-time and conversational applications. It features a specialized streaming speech tokenizer and a dual-transformer architecture that enables low latency and high-quality synthesis, making it suitable for interactive systems like...

Downloads: 3 This Week

Last Update: 2026-02-16
See Project
Fully managed relational database service for MySQL, PostgreSQL, and SQL Server
Focus on your application, and leave the database to us

Cloud SQL manages your databases so you don't have to, so your business can run without disruption. It automates all your backups, replication, patches, encryption, and storage capacity increases to give your applications the reliability, scalability, and security they need.

Try for free
5

ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

ComfyUI-LTXVideo is a bridge between ComfyUI’s node-based generative workflow environment and the LTX-Video multimedia processing framework, enabling creators to orchestrate complex video tasks within a visual graph paradigm. Instead of writing code to apply effects, transitions, edits, and data flows, users can assemble nodes that represent video inputs, transformations, and outputs, letting them prototype and automate video production pipelines visually. This integration empowers non-programmers and rapid-iteration teams to harness the performance of LTX-Video while maintaining the clarity and flexibility of a dataflow graph model. ...

Downloads: 3 This Week

Last Update: 2026-04-13
See Project
6

4M

4M: Massively Multimodal Masked Modeling

...The same model family can classify, segment, detect, caption, and even generate images, with a single interface for both discriminative and generative use. The repository releases code and models for multiple variants (e.g., 4M-7 and 4M-21), emphasizing transfer to unseen tasks and modalities. Training/inference configs and issues discuss things like depth tokenizers, input masks for generation, and CUDA build questions, signaling active research iteration. The design leans into flexibility and steerability, so prompts and masks can shape behavior without bespoke heads per task. ...

Downloads: 0 This Week

Last Update: 2025-10-08
See Project
7

Qwen3-VL

Qwen3-VL, the multimodal large language model series by Alibaba Cloud

Qwen3-VL is the latest multimodal large language model series from Alibaba Cloud’s Qwen team, designed to integrate advanced vision and language understanding. It represents a major upgrade in the Qwen lineup, with stronger text generation, deeper visual reasoning, and expanded multimodal comprehension. The model supports dense and Mixture-of-Experts (MoE) architectures, making it scalable from edge devices to cloud deployments, and is available in both instruction-tuned and...

Downloads: 4 This Week

Last Update: 3 days ago
See Project
8

Qwen3 Embedding

Designed for text embedding and ranking tasks

...It achieves state-of-the-art performance on benchmarks like MTEB (Multilingual Text Embedding Benchmark) and supports instruction-aware embedding (i.e. embedding task instructions along with queries) and flexible embedding/vector dimension definitions. It is meant for tasks such as text retrieval, classification, clustering, bitext mining, and code retrieval.

Downloads: 0 This Week

Last Update: 2025-09-30
See Project
9

DFlash

Block Diffusion for Ultra-Fast Speculative Decoding

...This approach has been shown to deliver lossless acceleration on models like Qwen3-8B by combining block diffusion techniques with efficient batching, making it ideal for applications where latency and cost matter. The project includes support for multiple draft models, example integration code, and scripts to benchmark performance, and it is structured to work with popular model serving stacks like SGLang and the Hugging Face Transformers ecosystem.

Downloads: 2 This Week

Last Update: 3 days ago
See Project
Find out just how much your login box can do for your customer | Auth0
With over 53 social login options, you can fast-track the signup and login experience for users.

From improving customer experience through seamless sign-on to making MFA as easy as a click of a button – your login box must find the right balance between user convenience, privacy and security.

Sign up
10

Qwen2-Audio

Repo of Qwen2-Audio chat & pretrained large audio language model

...It is evaluated on many benchmarks (speech recognition, translation, sound classification, emotion, etc.), and offers pretrained models (e.g. 7B) released via ModelScope and Hugging Face. Code & examples provided with Hugging Face transformers, and usage via AutoProcessor, model classes etc. High performance on many standard benchmarks: ASR, speech-emotion recognition, vocal sound classification, speech translation etc.

Downloads: 0 This Week

Last Update: 2025-09-23
See Project
11

Vidi2

Large Multimodal Models for Video Understanding and Editing

...Vidi targets applications like intelligent video editing, automated video search, content analysis, and editing assistance, enabling users to efficiently locate relevant segments and objects in hours-long footage. The system is built with open-source release in mind, giving developers access to model code, inference scripts, and evaluation pipelines so they can reproduce research results or integrate Vidi into their own video-processing workflows.

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
12

MiniMax-M2

MiniMax-M2, a model built for Max coding & agentic workflows

...The model is tuned for end-to-end developer flows such as multi-file edits, compile–run–fix loops, and test-validated repairs across real repositories and diverse programming languages. It is also optimized for multi-step agent tasks, planning and executing long toolchains that span shell commands, browsers, retrieval systems, and code runners. Benchmarks show that it achieves highly competitive scores on a wide range of intelligence and agent benchmarks, including SWE-Bench variants, Terminal-Bench, BrowseComp, GAIA, and several long-context reasoning suites.

Downloads: 0 This Week

Last Update: 2025-12-01
See Project
13

Z80-μLM

Z80-μLM is a 2-bit quantized language model

...A key deliverable is producing CP/M-compatible .COM binaries, enabling a genuinely vintage “chat with your computer” experience on real hardware or accurate emulators. The project sits at the intersection of machine learning and systems constraints, showing how model architecture, quantization, and inference code generation can be adapted to extreme memory and compute limits. It also functions as an educational reference for how to reduce inference to operations that fit an old-school instruction set and runtime environment.

1 Review

Downloads: 1 This Week

Last Update: 2026-01-27
See Project
14

Oasis

Inference script for Oasis 500M

Open-Oasis provides inference code and released weights for Oasis 500M, an interactive world model that generates gameplay frames conditioned on user keyboard input. Instead of rendering a pre-built game world, the system produces the next visual state via a diffusion-transformer approach, effectively “imagining” the world response to your actions in real time. The project focuses on enabling action-conditional frame generation so developers can experiment with interactive, model-generated environments rather than static video generation alone. ...

Downloads: 1 This Week

Last Update: 2026-01-06
See Project
15

Granite 3.0 Language Models

New set of lightweight state-of-the-art, open foundation models

...The repo positions the models for both research and commercial use under an Apache-2.0 license, signaling permissive adoption paths. Documentation highlights the capability mix (reasoning, tool use, code) and points to model artifacts and guidance for evaluation. Activity on the project shows an evolving codebase with open pull requests and standard GitHub project structure for issues and security visibility. In practice, this is a hub for acquiring Granite 3.0 variants and understanding how to integrate them into applications.

Downloads: 1 This Week

Last Update: 2025-10-08
See Project
16

DreamCraft3D

Official implementation of DreamCraft3D

...The name suggests a “dream crafting” metaphor—users probably supply textual or image prompts and generate 3D assets (point clouds, meshes, scenes). The repository includes model code, inference scripts, sample prompts, and possibly dataset preparation pipelines. It may integrate rendering or post-processing modules (e.g. mesh smoothing, texturing) to make the outputs more output-ready. Because 3D generation is hardware‐intensive, the repository likely also includes optimizations like quantization, pruning, or inference accelerations (e.g. using FlashMLA or DeepEP) to make the generation pipeline faster or more efficient. ...

Downloads: 1 This Week

Last Update: 2025-10-03
See Project
17

Tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models

...It also offers extension mechanisms so that custom encodings can be registered. Internally, it includes the core tokenizer logic (often implemented in Rust or efficient lower-level code), APIs for encoding, decoding, and counting tokens, and binding layers to Python (and sometimes other languages) for easy use.

Downloads: 1 This Week

Last Update: 2025-10-06
See Project
18

HunyuanVideo-I2V

A Customizable Image-to-Video Model based on HunyuanVideo

...It extends video generation so that given a static reference image plus an optional prompt, it generates a video sequence that preserves the reference image’s identity (especially in the first frame) and allows stylized effects via LoRA adapters. The repository includes pretrained weights, inference and sampling scripts, training code for LoRA effects, and support for parallel inference via xDiT. Resolution, video length, stability mode, flow shift, seed, CPU offload etc. Parallel inference support using xDiT for multi-GPU speedups. LoRA training / fine-tuning support to add special effects or customize generation.

Downloads: 1 This Week

Last Update: 2026-04-07
See Project
19

Gemma in PyTorch

The official PyTorch implementation of Google's Gemma models

...The repository demonstrates text generation pipelines, tokenizer setup, quantization paths, and adapters for low-rank or parameter-efficient fine-tuning. Example notebooks walk through instruction tuning and evaluation so teams can benchmark and iterate rapidly. The code is organized to be legible and hackable, exposing attention blocks, positional encodings, and head configurations. With standard PyTorch abstractions, it integrates easily into existing training loops, loggers, and evaluation harnesses.

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
20

Open Infra Index

Production-tested AI infrastructure tools

...FlashMLA, DeepEP, DeepGEMM, 3FS, etc.) that together form DeepSeek’s infrastructure stack. The repo's README describes the project as sharing “humble building blocks” of their online service—code that is documented, deployed, and battle-tested in production. The timing of its opening matches DeepSeek’s “Open-Source Week” campaign (starting around February 2025) when they gradually released internal infrastructure components publicly. It is licensed under CC0-1.0 (Creative Commons Zero) to maximize openness.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
21

MedGemma

Collection of Gemma 3 variants that are trained for performance

MedGemma is a collection of specialized open-source AI models created by Google as part of its Health AI Developer Foundations initiative, built on the Gemma 3 family of transformer models and trained for medical text and image comprehension tasks that help accelerate the development of healthcare-focused AI applications. It includes multiple variants such as a 4 billion-parameter multimodal model that can process both medical images and text and a 27 billion-parameter text-only (and...

Downloads: 0 This Week

Last Update: 2026-04-07
See Project
22

MobileCLIP

Implementation of "MobileCLIP" CVPR 2024

MobileCLIP is a family of efficient image-text embedding models designed for real-time, on-device retrieval and zero-shot classification. The repo provides training, inference, and evaluation code for MobileCLIP models trained on DataCompDR, and for newer MobileCLIP2 models trained on DFNDR. It includes an iOS demo app and Core ML artifacts to showcase practical, offline photo search and classification on iPhone-class hardware. Project notes highlight latency/accuracy trade-offs, with MobileCLIP2 variants matching or surpassing larger baselines at notably lower parameter counts and runtime on mobile devices. ...

Downloads: 0 This Week

Last Update: 6 days ago
See Project
23

OpenAI Realtime Embedded

Instructions on how to use the Realtime API on Microcontrollers

openai-realtime-embedded is a repository that provides resources, SDKs, and example links for using OpenAI’s Realtime API on embedded hardware platforms (e.g. microcontrollers). The goal is to enable low-latency conversational agents (e.g. voice-based assistants) running directly on constrained devices, by leveraging WebRTC and streaming APIs to communicate with OpenAI systems. The repo includes pointers to an ESP32 implementation (maintained as esp32 branch) and documentation that Espressif...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
24

TimesFM

Pretrained time-series foundation model developed by Google Research

...It provides a decoder-only model approach to forecasting, aiming for strong performance even in zero-shot or low-data settings where traditional models often struggle. The project includes code and an inference API intended to make it practical to run forecasts programmatically, with options to use different backends such as Torch or Flax depending on your environment and performance needs. Newer releases emphasize expanded context handling and more flexible forecasting outputs, including quantile forecasting so users can get uncertainty estimates rather than only point predictions. ...

Downloads: 0 This Week

Last Update: 2026-02-19
See Project
25

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution

MetaCLIP is a research codebase that extends the CLIP framework into a meta-learning / continual learning regime, aiming to adapt CLIP-style models to new tasks or domains efficiently. The goal is to preserve CLIP’s strong zero-shot transfer capability while enabling fast adaptation to domain shifts or novel class sets with minimal data and without catastrophic forgetting. The repository provides training logic, adaptation strategies (e.g. prompt tuning, adapter modules), and evaluation...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project

Previous
1
2
You're on page 3
4
5
Next

Related Searches

forensic audio analysis

audio voice

Related Categories

Artificial Intelligence

Scientific/Engineering

Multimedia

Security

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise