Skip to content
@ModelCloud

ModelCloud.ai

Our mission is to give allow everyone, including bots, unlimited and free access to llm/ai models.

Pinned Loading

  1. GPTQModel GPTQModel Public

    LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

    Python 1.1k 168

  2. Device-SMI Device-SMI Public

    Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it y…

    Python 14 1

Repositories

Showing 10 of 15 repositories
  • GPTQModel Public

    LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

    ModelCloud/GPTQModel’s past year of commit activity
    Python 1,068 168 45 17 Updated Mar 24, 2026
  • Defuser Public

    Model defuser helper for HF Transformers

    ModelCloud/Defuser’s past year of commit activity
    Python 1 Apache-2.0 0 0 0 Updated Mar 24, 2026
  • PyPcre Public
    ModelCloud/PyPcre’s past year of commit activity
    Python 2 Apache-2.0 2 0 1 Updated Mar 23, 2026
  • LogBar Public

    A unified Logger and ProgressBar util with zero dependencies.

    ModelCloud/LogBar’s past year of commit activity
    Python 8 Apache-2.0 0 0 0 Updated Mar 23, 2026
  • Tokenicer Public

    A (nicer) tokenizer you want to use for model inference and training: with all known peventable gotchas normalized or auto-fixed.

    ModelCloud/Tokenicer’s past year of commit activity
    Python 11 Apache-2.0 4 0 0 Updated Mar 15, 2026
  • Device-SMI Public

    Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it yourself.

    ModelCloud/Device-SMI’s past year of commit activity
    Python 14 Apache-2.0 1 0 1 Updated Dec 12, 2025
  • MemLord Public
    ModelCloud/MemLord’s past year of commit activity
    Python 1 Apache-2.0 0 0 1 Updated Nov 21, 2025
  • lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    ModelCloud/lm-evaluation-harness’s past year of commit activity
    Python 0 MIT 3,146 0 0 Updated Apr 17, 2025
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    ModelCloud/vllm’s past year of commit activity
    Python 1 Apache-2.0 14,865 0 0 Updated Mar 27, 2025
  • rockthem Public
    ModelCloud/rockthem’s past year of commit activity
    Cuda 0 Apache-2.0 0 0 0 Updated Mar 13, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…