Swoop Search

Software Engineer – AI Infrastructure

Swoop Search

full-time

Posted on:

Location Type: Hybrid

Location: Minneapolis-St. PaulMinnesotaWashingtonUnited States

Visit company website

Explore more

AI Apply
Apply

About the role

  • Develop and maintain Swoop’s LLM offering
  • Expand the capabilities of the LLM to interact with the system via tool calls
  • Expand the data searching capabilities of the LLM
  • Work hand-in-hand with frontend developers to build out new LLM features and improve existing ones
  • Monitor the resource usage of installations and make sure that the LLM offering is as efficient and fast as possible given the inference hardware available
  • Maintain and optimize inference engine architecture
  • Tune data storage configurations to optimize for scale and near real-time availability in a streaming architecture
  • Ensure our services have strong availability and service level agreements across our code base, especially as it pertains to the runtime of our Kubernetes cluster in production

Requirements

  • Bachelor's degree in Computer Science or related technical field, or equivalent technical experience
  • Firm understanding of scalable large language model infrastructure
  • Experience with low-level NVIDIA drivers and NVIDIA Kubernetes Container Toolkit
  • Familiarity with designing RAG information retrieval systems and time-series anomaly detection
  • Experience with PyTorch, training and fine-tuning Machine Learning models for resource-light environments
  • Experience with Kubernetes in a production environment
  • Proficient Python coding ability with good understanding of data structures and data models
  • Active US Security clearance or ability and willingness to be sponsored for a US Security clearance.
Benefits
  • Swoop Technologies is proud to be an equal opportunity employer.
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
large language model infrastructureNVIDIA driversNVIDIA Kubernetes Container ToolkitRAG information retrieval systemstime-series anomaly detectionPyTorchMachine Learning modelsPythondata structuresdata models
Certifications
Bachelor's degree in Computer ScienceUS Security clearance