
Research Engineer – RL Environments
The Codest
full-time
Posted on:
Location Type: Remote
Location: Poland
Visit company websiteExplore more
Salary
💰 PLN 34,000 - PLN 44,000 per month
About the role
- Design and build MLE/SWE environments and diverse tasks.
- Target a specified language model and satisfy the required difficulty distribution.
Requirements
- Experience with PyTorch or JAX at the framework level (not just importing a model)
- Familiarity with RL concepts: reward functions, environment design, training loops, evaluation
- Ability to read ML papers and implement them. This is a core part of the job.
- Production Python skills: Docker, git, clean code, reproducible environments.
- Exposure to any of: model training/finetuning, inference optimization, CUDA/Triton kernels, distributed training, model internals (attention, KV caches, tokenizers)
Benefits
- 100% remote work
- 300 PLN to use on our benefits platform, Worksmile - gift cards, medical services, sports, etc.
- Integration events, education opportunities and much more…
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PyTorchJAXreinforcement learningreward functionsenvironment designtraining loopsevaluationPythonCUDAdistributed training
Soft Skills
ability to read ML papersimplementation skills