Loading...
1 open positions
Showing 1-1 of 1 matching jobs.
Machine Learning Engineer | Python | Pytorch | Distributed Training | Optimisation | GPU | Hybrid, San Jose, CA Title: Machine Learning Engineer Location: San Jose, CA Responsibilities: Productize and optimize models from Research into reliable, performant, and cost-efficient services with clear SLOs (latency, availability, cost). Scale training across nodes/GPUs (DDP/FSDP/ZeRO, pipeline/tensor parallelism) and own throughput/time-to-train using profiling and optimization. Implement model-efficiency techniques (quantization, distillation, pruning, KV-cache, Flash Attention) for training and ...