The Codest

Research Engineer – RL Environments

The Codest

full-time

Posted on:

Location Type: Remote

Location: Poland

Visit company website

Explore more

AI Apply
Apply

Salary

💰 PLN 34,000 - PLN 44,000 per month

About the role

  • Design and build MLE/SWE environments and diverse tasks.
  • Target a specified language model and satisfy the required difficulty distribution.

Requirements

  • Experience with PyTorch or JAX at the framework level (not just importing a model)
  • Familiarity with RL concepts: reward functions, environment design, training loops, evaluation
  • Ability to read ML papers and implement them. This is a core part of the job.
  • Production Python skills: Docker, git, clean code, reproducible environments.
  • Exposure to any of: model training/finetuning, inference optimization, CUDA/Triton kernels, distributed training, model internals (attention, KV caches, tokenizers)
Benefits
  • 100% remote work
  • 300 PLN to use on our benefits platform, Worksmile - gift cards, medical services, sports, etc.
  • Integration events, education opportunities and much more…
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PyTorchJAXreinforcement learningreward functionsenvironment designtraining loopsevaluationPythonCUDAdistributed training
Soft Skills
ability to read ML papersimplementation skills