Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
NetCraftsmen, now BlueAlly

Senior AI Engineer

NetCraftsmen, now BlueAlly

Senior AI Engineer designing and operating enterprise AI systems across client portfolios for BlueAlly. Leading end-to-end AI solutions and engaging directly with clients throughout the process.

Posted 6/7/2026full-timeRemote • 🇺🇸 United StatesSenior💰 $180,000 - $200,000 per yearWebsite

Tech Stack

Tools & technologies
DNSDockerKubernetesLinuxPythonTCP/IP

About the role

Key responsibilities & impact
  • Design, build, and operate enterprise AI systems across our client portfolio.
  • Work end-to-end across the AI stack — from inference engines and platform infrastructure up through application-level engineering.
  • Lead end-to-end design, build, and operation of AI systems on AI Factory platforms across multiple client engagements.
  • Engineer and tune LLM inference serving stacks — primary depth in vLLM with breadth across the inference ecosystem — for client latency, throughput, and cost targets.
  • Tune inference performance through KV cache management, paged attention, batching strategies, and Dynamo-based disaggregated serving.
  • Architect and operate MLOps pipelines covering model lifecycle, registries, deployment, rollback, and observability.
  • Design and engineer RAG applications on top of vector databases.
  • Build and tune prompt-engineering patterns at production scale.
  • Engineer high-performance storage and networking for AI workloads.
  • Operate Kubernetes clusters underpinning AI workloads.
  • Build and maintain container images, registries, and CI/CD pipelines for AI/ML services.
  • Implement monitoring, alerting, logging, and capacity planning across the AI stack.
  • Harden environments to meet client security and compliance requirements.
  • Lead troubleshooting across various environments and technologies.
  • Engage directly with client stakeholders — technical and executive — to communicate status, root cause, options, and recommendations.
  • Mentor and code-review work from less senior engineers; raise the technical bar of every engagement you join.
  • Author runbooks, reference architectures, and knowledge base content; lead client knowledge transfer and enablement sessions.
  • Participate in on-call rotation and incident response for production AI workloads.
  • Contribute reusable patterns, tooling, and reference designs back to the practice.

Requirements

What you’ll need
  • 7+ years of software, data, or infrastructure engineering, with 3+ years specifically working with modern AI / LLM systems.
  • Production-quality Python at engineering level — testing, code review, version control fluency, and shipping code that other engineers depend on.
  • Deep production Linux experience, including system internals, performance tuning, and troubleshooting.
  • Deep proficiency with Docker — image build, registry management, runtime tuning, and container security.
  • Strong server-platform skills including CPU/GPU topologies, PCIe, BMC management, BIOS/firmware lifecycle, and physical-to-logical troubleshooting.
  • Hands-on experience deploying and operating one or more of HPE PCAI, Dell AI Factory, or Nutanix Enterprise AI.
  • Production experience deploying, tuning, and operating vLLM.
  • Working knowledge of multiple inference and model-serving frameworks beyond vLLM, with the ability to choose and tune the right tool for each workload.
  • Hands-on experience with high-throughput, low-latency storage and network fabrics for AI workloads — including RDMA-class interconnects, parallel/object storage tiers, KV cache management, and Dynamo-style disaggregated serving.
  • Practical experience operating MLOps tooling and patterns — model registries, deployment pipelines, GitOps, lineage, and rollback.
  • Hands-on experience deploying, tuning, and integrating vector databases and RAG pipelines, including the application-level engineering that sits on top of them.
  • Production experience designing system prompts, structured output, function calling, and tool-using LLM patterns.
  • Demonstrated experience designing LLM evaluation harnesses — golden sets, regression suites, and quality/cost metrics.
  • Demonstrated ability to engage directly with client stakeholders — running working sessions, presenting recommendations, and translating technical detail for non-technical audiences.
  • Strong written and verbal communication — clear reference architectures, runbooks, and incident reports.
  • Track record of mentoring more junior engineers and raising team technical quality through code review and pairing.
  • TCP/IP, DNS, load balancing, VLANs, and firewall administration.
  • Comfort working across multiple concurrent client environments and managing competing priorities under SLA.

Benefits

Comp & perks
  • 🌐 Worldwide ❌ Jobs You've Hidden ⭐️ Saved Jobs ✅ Applied Jobs ✉️ Email Alerts 👤 Account NetCraftsmen, now BlueAlly Website LinkedIn All Job Openings 51 - 200 employees NetCraftsmen, was acquired by BlueAlly in March 2022. At BlueAlly, our mission is to make technology more accessible, more certain, and more impactful for every organization. Senior AI Engineer Job not on LinkedIn 🔥 9 minutes ago 🇺🇸 United States – Remote 💵 $180k - $200k / year ⏰ Full Time 🟠 Senior 🤖 AI Engineer DNS Docker Kubernetes Linux Python TCP/IP Apply Now Find Hiring Managers Customize resume + cover letter Report problem ☆ Save ☑️ Mark as applied ❌ Hide 📋 Description
  • Design, build, and operate enterprise AI systems across our client portfolio.
  • Work end-to-end across the AI stack — from inference engines and platform infrastructure up through application-level engineering.
  • Lead end-to-end design, build, and operation of AI systems on AI Factory platforms across multiple client engagements.
  • Engineer and tune LLM inference serving stacks — primary depth in vLLM with breadth across the inference ecosystem — for client latency, throughput, and cost targets.
  • Tune inference performance through KV cache management, paged attention, batching strategies, and Dynamo-based disaggregated serving.
  • Architect and operate MLOps pipelines covering model lifecycle, registries, deployment, rollback, and observability.
  • Design and engineer RAG applications on top of vector databases.
  • Build and tune prompt-engineering patterns at production scale.
  • Engineer high-performance storage and networking for AI workloads.
  • Operate Kubernetes clusters underpinning AI workloads.
  • Build and maintain container images, registries, and CI/CD pipelines for AI/ML services.
  • Implement monitoring, alerting, logging, and capacity planning across the AI stack.
  • Harden environments to meet client security and compliance requirements.
  • Lead troubleshooting across various environments and technologies.
  • Engage directly with client stakeholders — technical and executive — to communicate status, root cause, options, and recommendations.
  • Mentor and code-review work from less senior engineers; raise the technical bar of every engagement you join.
  • Author runbooks, reference architectures, and knowledge base content; lead client knowledge transfer and enablement sessions.
  • Participate in on-call rotation and incident response for production AI workloads.
  • Contribute reusable patterns, tooling, and reference designs back to the practice. 🎯 Requirements
  • 7+ years of software, data, or infrastructure engineering, with 3+ years specifically working with modern AI / LLM systems.
  • Production-quality Python at engineering level — testing, code review, version control fluency, and shipping code that other engineers depend on.
  • Deep production Linux experience, including system internals, performance tuning, and troubleshooting.
  • Deep proficiency with Docker — image build, registry management, runtime tuning, and container security.
  • Strong server-platform skills including CPU/GPU topologies, PCIe, BMC management, BIOS/firmware lifecycle, and physical-to-logical troubleshooting.
  • Hands-on experience deploying and operating one or more of HPE PCAI, Dell AI Factory, or Nutanix Enterprise AI.
  • Production experience deploying, tuning, and operating vLLM.
  • Working knowledge of multiple inference and model-serving frameworks beyond vLLM, with the ability to choose and tune the right tool for each workload.
  • Hands-on experience with high-throughput, low-latency storage and network fabrics for AI workloads — including RDMA-class interconnects, parallel/object storage tiers, KV cache management, and Dynamo-style disaggregated serving.
  • Practical experience operating MLOps tooling and patterns — model registries, deployment pipelines, GitOps, lineage, and rollback.
  • Hands-on experience deploying, tuning, and integrating vector databases and RAG pipelines, including the application-level engineering that sits on top of them.
  • Production experience designing system prompts, structured output, function calling, and tool-using LLM patterns.
  • Demonstrated experience designing LLM evaluation harnesses — golden sets, regression suites, and quality/cost metrics.
  • Demonstrated ability to engage directly with client stakeholders — running working sessions, presenting recommendations, and translating technical detail for non-technical audiences.
  • Strong written and verbal communication — clear reference architectures, runbooks, and incident reports.
  • Track record of mentoring more junior engineers and raising team technical quality through code review and pairing.
  • TCP/IP, DNS, load balancing, VLANs, and firewall administration.
  • Comfort working across multiple concurrent client environments and managing competing priorities under SLA. Apply Now 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score Similar Jobs Senior AI Engineer 🔥 14 hours ago Anaplan 1001 - 5000 ☁️ SaaS 🏢 Enterprise 💸 Finance Website LinkedIn All Job Openings Senior AI Engineer developing and deploying Generative AI and Machine Learning systems. Owning architecture and collaborating with teams to build innovative capabilities at Anaplan. 🇺🇸 United States – Remote 💰 Secondary Market on 2018-03 ⏰ Full Time 🟠 Senior 🤖 AI Engineer 🦅 H1B Visa Sponsor Python Lead AI Engineer, Business Operations 🕒 Yesterday AFL 1001 - 5000 📡 Telecommunications 🔧 Hardware ⚡ Energy Website LinkedIn All Job Openings Lead AI Engineer responsible for developing agentic AI systems at AFL. Working within Business Operations to automate operational processes through innovative AI solutions. 🇺🇸 United States – Remote ⏰ Full Time 🟠 Senior 🤖 AI Engineer 🦅 H1B Visa Sponsor Cloud Python AI Engineer – Architect 🕒 Yesterday General Dynamics Information Technology 10,000+ employees 🔒 Cybersecurity 🤖 Artificial Intelligence Website LinkedIn All Job Openings AI Engineer/Architect leading design and architecture of next-gen AI/ML solutions for GDIT. Collaborating with business development and providing innovative AI/ML solutions for complex challenges. 🇺🇸 United States – Remote 💵 $153k - $207k / year ⏰ Full Time 🟠 Senior 🔴 Lead 🤖 AI Engineer 🦅 H1B Visa Sponsor Senior Technical Product Manager, AI Engineering – Systems 🕒 Yesterday Jerry 201 - 500 💳 Fintech Website LinkedIn All Job Openings Lead AI product development for Jerry.ai, transforming car ownership with automation and integrated services. Collaborate with engineering and OpenAI for innovative solutions. 🇺🇸 United States – Remote 💵 $160k - $200k / year ⏰ Full Time 🟠 Senior 🤖 AI Engineer 🦅 H1B Visa Sponsor People Analytics AI Engineer 🕒 Yesterday Samsara 1001 - 5000 🏢 Enterprise 🚗 Transport 🔐 Security Website LinkedIn All Job Openings Staff AI Engineer within the People team leading HR workflows and AI initiatives at Samsara. Designing secure applications to transform manual HR work into efficient solutions. 🇺🇸 United States – Remote 💵 $146.4k - $221.4k / year 💰 Seed Round on 2014-08 ⏰ Full Time 🟠 Senior 🔴 Lead 🤖 AI Engineer 🦅 H1B Visa Sponsor NoSQL Python SQL View More AI Engineer Jobs 🌐 Worldwide Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com Search Search Jobs by country Search jobs by city Search jobs by job title Search entry-level jobs Search junior-level jobs Search senior-level jobs Search jobs by tech stack Search jobs by contract type Search remote internships Search remote part-time jobs Remote jobs Anywhere in the World Companies Hiring Anywhere in the World Companies Hiring Sales People Anywhere in the World Companies Hiring Software Engineers Anywhere in the World Resources Advice Tips for finding remote jobs Interview questions and answers Resume examples Cover letter examples Post a job Affiliates Privacy policy Terms of service Job board SEO course AI Apply Copilot OpenClaw job finder Jobs by Country Remote jobs anywhere in the world (Worldwide remote jobs) Remote jobs United States Remote jobs Australia Remote jobs Brazil Remote jobs Canada Remote jobs France Remote jobs Ireland Remote jobs Germany Remote jobs Netherlands Remote jobs Spain Remote jobs UK Popular Jobs Remote data analyst jobs Remote customer support jobs Remote executive assistant jobs Remote marketing jobs Remote product designer jobs Remote product manager jobs Remote project manager jobs Remote recruiter jobs Remote sales jobs Remote software engineer jobs Jobs by Type Remote full-time jobs Remote part-time jobs Remote contract jobs Remote internship jobs Remote entry-level jobs Remote jobs with no experience required Remote junior jobs (1-3 years of experience) Digital nomad jobs Remote jobs with no degree required Freelance remote jobs Temporary remote jobs Remote jobs hiring now Stay at home mom jobs

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PythonLinuxDockervLLMMLOpsvector databasesRAG applicationsinference engineshigh-throughput storagelow-latency networking
Soft Skills
communicationmentoringtroubleshootingclient engagementtechnical writingteam collaborationproblem-solvingcode reviewpresentation skillsprioritization