
Site Reliability Engineer
Motorola Solutions
full-time
Posted on:
Location Type: Remote
Location: Brazil
Visit company websiteExplore more
About the role
- Diagnose complex, intermittent, and high-impact issues to maintain system stability
- Research and utilize advanced diagnostic tools to troubleshoot ongoing customer issues within live production environments
- Identify single points of failure in the architecture to re-design systems for maximum redundancy and auto-recovery
- Analyze application source code in Java and Angular to identify memory leaks, race conditions, or inefficient logic
- Propose and implement code fixes directly to improve long-term system reliability rather than simply filing bug tickets
- Adjust kernel parameters and network stack configurations to optimize low-level system performance
- Build internal tooling to empower other engineering teams to self-serve their infrastructure needs
- Develop high-quality automation to ensure that manually solved problems are never repeated
- Tweak database queries and application thread pools to tune the performance of the entire software stack
- Serve as a critical member of the on-call rotation to respond to and mitigate major system outages
- Lead incident command efforts during high-pressure situations to restore service and protect critical data flows
- Conduct post-incident reviews to convert outages into actionable architectural improvements
Requirements
- 5+ years of experience with Java or Angular to debug and patch application-level reliability issues
- 5+ years of experience with Linux Internals to tune kernel parameters and network stack configurations
- Advanced English Proficiency
- Expertise in Infrastructure as Code (IaC) to build automated, repeatable environments
- Expertise in Database Optimization to refine complex queries and improve data retrieval speeds
- 5+ years of experience in a Site Reliability Engineering or DevOps role to manage high-availability production environments
- 5+ years of experience with Cloud Infrastructure to design resilient and scalable architectures
- Experience in Incident Command to lead the resolution of mission-critical system outages
- Bachelor’s degree in Computer Science, Software Engineering, or a related technical field
Benefits
- Health insurance
- Flexible work arrangements
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
JavaAngularLinux InternalsInfrastructure as Code (IaC)Database OptimizationCloud InfrastructureAutomationKernel parameters tuningNetwork stack configurationsApplication performance tuning
Soft Skills
Advanced English ProficiencyIncident Command leadershipProblem-solvingCritical thinkingCollaborationCommunicationAdaptabilityResilienceLeadershipPost-incident review
Certifications
Bachelor’s degree in Computer ScienceBachelor’s degree in Software Engineering