
Machine Learning Systems Research Intern
Red Hat
internship
Posted on:
Location Type: Hybrid
Location: Boston • Massachusetts • United States
Visit company websiteExplore more
Job Level
About the role
- Research and implement techniques for LLM inference and LLM optimizations.
- Conduct experiments to evaluate the impact of optimization methods on model accuracy, latency, and throughput.
- Collaborate with researchers and engineers to integrate optimizations into real-world machine learning workflows.
- Document findings and contribute to technical reports, blog posts, or research publications.
Requirements
- Currently pursuing a Ph.D. degree in Computer Science, Electrical Engineering, Machine Learning, or a related field
- Strong programming skills in C++, CUDA, and Python
- Experience with tensor math libraries such as PyTorch
- Familiarity with AI model optimization techniques such as quantization (e.g., INT4, FP8), pruning, and knowledge distillation
- Deep understanding and experience in GPU performance optimizations
- Excellent knowledge of large language model architectures
- Strong analytical and problem-solving skills
- Excellent communication skills and ability to work in a team-oriented research environment
- Background in efficient inference techniques for large-scale language models or computer vision models
- Prior experience contributing to open-source ML frameworks or research publications
- 1 or more co-authored papers at a top tier conference like NeurIPS, ICLR, ACL, CVPR, MLSys is a big plus.
Benefits
- Competitive stipend
- Mentorship from leading experts in machine learning and model efficiency
- Opportunity to contribute to research papers, patents, or open-source projects
- Flexible work arrangements
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
C++CUDAPythonPyTorchquantizationpruningknowledge distillationGPU performance optimizationslarge language model architecturesefficient inference techniques
Soft Skills
analytical skillsproblem-solving skillscommunication skillsteamwork
Certifications
Ph.D. in Computer SciencePh.D. in Electrical EngineeringPh.D. in Machine Learning