unified-memory

Talos-O (Omni): A sovereign, embodied agentic organism forged on AMD Strix Halo. Integrating the Chimera Kernel (Linux 7.0), Zero-Copy Introspection, and the Phronesis Engine. Built from First Principles.

zero-copy linux-kernel first-principles linux-kernel-hacking unified-memory embodied-ai unified-memory-parallelism ryzen-ai sovereign-ai strix-halo ryzen-ai-max first-principles-ai rocm-6-2 neo-techne phronesis

Updated Apr 11, 2026
Python

parallelArchitect / cuda-unified-memory-analyzer

Star

gpu thrashingNVIDIA GPU Unified Memory diagnostic tool — architecture-aware, measurement-based, PCIe/coherent transport detection

Updated Apr 7, 2026
Cuda

atakehiro / 3D-U-Net-TFLMS-Keras

Star

3D U-Net with tf.keras for Large-Model-Support or Unified Memory

3d-unet tf-keras large-network unified-memory tf-lms

Updated Aug 9, 2020
Python

sadopc / unified-db-2

Star

Apple Silicon Unified Memory for GPU-Accelerated Analytics — TPC-H benchmarks across DuckDB, NumPy, and MLX

benchmark numpy gpu-computing mlx tpc-h unified-memory duckdb apple-silicon apple-m4 gpu-analytics

Updated Feb 18, 2026
Python

parallelArchitect / pascal-um-benchmark

Star

Reproducible Pascal GPU Unified Memory benchmark with Nsight and nvprof profiling

benchmark-suite memory-bandwidth page-faults nvprof unified-memory nsight-systems cuda-pascal-unified-memory-gpu nsight-nvprof-pcie-bandwidth cudamallocmanaged cudamemprefetchasync cudapascal

Updated Feb 1, 2026
Python

sl-badcoder / UVM_benchmark_Extended

Star

Extended the UVM Benchmark such that we can test for huge data workloads(16GiB and more). Needed to make it overflow save and add dataset creation logic for some Applications.

cnn kmeans knn bfs-algorithm unified-memory bayessian-networks

Updated Feb 12, 2026
Roff

cloudlinqed / WayOS

Star

AI-native OS kernel written from scratch in C and x86_64/aarch64 assembly — kernel-level tensor compute, capability-based security, SMP, TCP/IP, and 95 userland programs

unified-memory ai-agent ai-native ai-operating-system

Updated Apr 2, 2026
C

parallelArchitect / nvidia-gpu-val

Star

NVIDIA GPU validation: PCIe transport, Unified Memory prefetch, SGEMM compute, drift detection.

Updated Feb 25, 2026
Python

raspoli / mlx-serve

Star

Local inference server for Apple Silicon — hot-swaps MLX models (LLM, vision, embeddings, TTS, STT) via OpenAI API

python macos machine-learning text-to-speech embeddings speech-to-text inference-server mlx model-serving fastapi unified-memory apple-silicon openai-api llm local-inference vision-language-model local-llm mlx-lm openai-compatible

Updated Mar 31, 2026
Python

cloudlinqed / WayInfer

Star

Run LLMs larger than your RAM — native GGUF inference engine with SSD streaming, no GPU required

ai runtime unified-memory ai-inference llm-inference wayos

Updated Apr 2, 2026
C

parallelArchitect / nvidia-uma-fault-probe

Star

Cycle-accurate UMA fault latency and bandwidth measurement for NVIDIA GPUs. C and PTX. No Python. Pascal (SM 6.0) through Blackwell GB10 (SM 12.1).

pascal cuda nvidia bandwidth rtx unified-memory gb10 gpu-diagnostics dgx-spark

Updated Apr 12, 2026
Cuda

atcuality2021 / manthanquant

Star

3-bit Lloyd-Max KV Cache Compression for LLM Inference on NVIDIA DGX Spark GB10 — 5.12x compression, 0.983 cosine similarity, pure numpy on ARM unified memory

compression numpy transformers quantization lloyd-max kv-cache unified-memory vllm llm-inference vibe-coding claude-code gb10 nvidia-dgx-spark arm-aarch64

Updated Apr 3, 2026
Python

sl-badcoder / GPUOpt

Star

This project will provide an overview on how to programm a GPU. How can we exploit Unified Memory and is it an actual competition to pinned memory.

gpu optimization cuda unified-memory mapped-memory pinned-memory

Updated Feb 12, 2026
Cuda

GetNyrex / strix-halo-guide

Star

Unlock fast, local LLM inference on AMD-powered mini PCs delivering 65-87 t/s for large models without cloud or subscription costs

amd optimization inference rocm mini-pc asus-rog linux-gaming unified-memory beelink cachyos llm llama-cpp local-llm ollama gguf rdna3 strix-halo gfx1151

Updated Apr 12, 2026
Shell

Improve this page

Add a description, image, and links to the unified-memory topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the unified-memory topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unified-memory

Here are 20 public repositories matching this topic...

real-space / tfQMRgpu

hogeheer499-commits / strix-halo-guide

hamtun24 / openuma

shumbul / Accelerated-Computing

lintenn / cudaAddVectors-explicit-vs-unified-memory

CINOAdam / nvml-unified-shim

ChrisJR035 / Talos-O-Architecture

parallelArchitect / cuda-unified-memory-analyzer

atakehiro / 3D-U-Net-TFLMS-Keras

sadopc / unified-db-2

parallelArchitect / pascal-um-benchmark

sl-badcoder / UVM_benchmark_Extended

cloudlinqed / WayOS

parallelArchitect / nvidia-gpu-val

raspoli / mlx-serve

cloudlinqed / WayInfer

parallelArchitect / nvidia-uma-fault-probe

atcuality2021 / manthanquant

sl-badcoder / GPUOpt

GetNyrex / strix-halo-guide

Improve this page

Add this topic to your repo