Skip to content
View Mister-Raggs's full-sized avatar

Highlights

  • Pro

Block or report Mister-Raggs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Mister-Raggs/README.md

Raghav Kachroo

Software engineer focused on distributed systems and operational reliability, currently applying that lens to AI infrastructure.

At Amazon I worked on high-throughput log indexing and on-call automation. Before that, I built document intelligence systems at Aark Global — async ingestion pipelines, OCR, search, and datastore migration — and earlier, streaming data infrastructure at Concentrix (Kafka, Airflow, Elasticsearch at scale). I'm currently at UCSD's Hao AI Lab, contributing to FastVideo, an open-source video-diffusion framework. Recent shipped work: a PostDecode optimization that cut Cosmos 2.5 end-to-end latency by 15.6% (cross-validated on H100), torch.compile documentation surfacing a hidden 24% speedup, the Cosmos 2.5 training pipeline (LoRA + full fine-tune, merged), and FA2/FA3 as a torch.library custom_op — the traceable-node enabler for downstream graph-break work. Current focus: real backward registration for FlashAttention under torch.compile and the broader inference-optimization loop.

Projects worth looking at:

log-triage-v2 — Go log indexing engine, a portable rewrite of an internal Amazon tool. Time-sorted index with binary search (49ns at 1k entries, 77ns at 1M — cache-dominated, not comparison-bound), channel-based worker pool for parallel ingestion at 42M rows/hour, write-ahead log with batched fsync, Prometheus + OpenTelemetry instrumentation.

Flare — End-to-end ML log anomaly detection pipeline. Drain3 template mining + multi-model scoring (Isolation Forest, LOF, One-Class SVM) + DBSCAN incident clustering + Claude summarization. 0.642 F1 at ~40ms CPU-only latency on the full HDFS dataset (11.2M lines), LLM-as-judge eval scoring 4.67/5 without ground truth labels, MLflow experiment tracking across runs.

Writing: mister-raggs.github.io
LinkedIn: linkedin.com/in/raghavkachroo

Pinned Loading

  1. hao-ai-lab/FastVideo hao-ai-lab/FastVideo Public

    A unified inference and post-training framework for accelerated video generation.

    Python 3.7k 369

  2. flare flare Public

    AIOps pipeline for log anomaly detection — Drain3 parsing, Isolation Forest scoring, DBSCAN incident clustering, LLM-powered summaries via Claude, and an LLM-as-judge eval harness. FastAPI + dashbo…

    Python

  3. log-triage-v2 log-triage-v2 Public

    Go log query service for incident triage — sub-millisecond nearest-timestamp lookup on a time-sorted in-memory index, built for 42M rows/hour. Kubernetes-native with Prometheus instrumentation.

    Go

  4. Parade Parade Public

    Pipeline that monitors VC funding news to surface recently-funded companies likely to be hiring. Feeds the top of a targeted job search funnel.

    Python

  5. JobHunter JobHunter Public

    Multi-ATS job scraper with a unified abstraction over Greenhouse, Ashby, Lever, and Apple careers — SQLite deduplication, email delivery

    Python

  6. Mister-Raggs.github.io Mister-Raggs.github.io Public

    Portfolio / blog / personal website powered by Astro

    Astro