AI Inference
Performance Engineer

Building faster inference through latency benchmarks, runtime validation, and GPU-aware systems

CUDA C++ PyTorch Model Serving Latency Benchmarking Runtime Systems

Suraj profile photo

EXPERIENCE

Real-Time Simulation Systems at Boeing

  • Built real-time simulation systems in Unity + C++ modeling aircraft electrical systems and faults
  • Designed modular, state-driven architectures and optimized CPU, memory, and rendering performance
C++ Integration • Performance Optimization

Infrastructure at Prisms VR

  • Built TeamCity PR gates across 3 Unity repositories, blocking failed builds before merge.
  • Automated package sync, lockfile regeneration, CI triage, and release validation workflows.
TeamCity • CI Validation • Release Automation
Let’s Make Inference Faster
surajm99@outlook.com