We pride ourselves on being a modern software development shop. Our managers all come from software engineering backgrounds and have a natural inclination to leverage novel technologies and processes that can help our business succeed. Some highlights of what we offer:
We will consider remote candidates for this role.
We believe in team autonomy. Our teams strive to own their work from development to release and everything in-between. This includes DevOps considerations, ensuring quality through tools such as testing automation, optimizing for speed and reliability, etc.
We believe in investing in and pursuing excellence. We reserve time for addressing technical debt, advancing architecture, and providing teams the bandwidth to experiment and iterate.
Work-life balance is a priority. We work on flexible schedules. Many of our team members work from home one or two days a week, some arrive at 7:00am, others arrive at 9:30. 40 hour work weeks are the norm.
We believe in career investments. Employees are encouraged to spend time learning new skills of their choosing during work hours. Each of our employees has access to their own PluralSight account.
We view the diversity on our teams as a key strength.
We practice Agile principles. We run two week sprints, have embedded product ownership on the teams, use Kanban / Scrum, etc. We dont prescribe methodologies, however, believing that each team should choose a process that they believe best fits them.
Were in the cloud. We have major applications in Microsofts Azure platform, have MSDN licenses, and each have a generous monthly allowance that can be used to experiment with the platform.
We take hardware seriously. We each have a powerhouse laptop and three monitors at our workstations.
We have a casual dress code.
Each team generally has its own team room.
We strive to create an engaging workplace. There are often board games at lunch, video game tournaments, and mixers held in the office.
Were looking for an engineer to join our Site Reliability (SRE) team. We combine software and systems engineering, security, tooling, debugging, and performance in order to ensure that our software applications are always running optimally and efficiently. The work draws on a broad range of skills and knowledge. We dont expect you to know everything on day one, but we do expect youll learn and grow in this role as you increase your strengths, versatility and technical know-how. Youll assist with mitigating some production issues, but largely focus on ways to architect and improve the resiliency and reliability of our applications. Were looking for folks who are pragmatic, eager learners, and committed generalists.
Our offices are in Buffalo NY, but we will consider remote working arrangements for qualified applicants.
3+ years of hands-on experience with a majority of the following, along with a willingness to learn the remaining areas:
Windows Server and Linux
Designing, analyzing, and troubleshooting distributed systems in the cloud (Azure, AWS, GCP, etc.).
Secrets management with Azure Key Vault or AWS Secrets Manager, or similar systems
Automation and configuration management using PowerShell DSC, Terraform, Chef, etc.
Firewalls and load balancers
Networking including virtual networks and subnets
Application performance monitoring tools such as New Relic
Infrastructure monitoring tools
Command line operations
Security operations with tools for SAST, DAST, RAST, and WAF
C#, Java, or other programming languages
SQL Server, SQL Server clustering
Performance tuning and identifying performance bottlenecks (both infrastructure and code)
Additionally, here are some other expectations of this role:
Demonstrate systemic approaches to problem solving ambiguous problems and a sense of ownership and drive.
Demonstrate commitment to building scalable, maintainable software solutions.
Practice sustainable incident response as needed.
Ability to debug and optimize code
Pragmatically identify opportunities and execute on automating routine tasks
Identify and share opportunities to improve our processes and practices
Identify and share opportunities to improve our infrastructure and reduce toil
Understands the importance, and can produce well-crafted documentation
Adhere to and advocate for best practices, including Infrastructure as Code, monitoring, high availability, disaster recovery, security, and DevOps methodologies
Contribute to capacity planning, advise and consult with teams who will be load/stress testing
Keep up with industry innovations, recommending new tools or practices when appropriate
Willingness to occasionally work outside of core business hours to execute on after-hours maintenance or triaging production system outages
EOE, including disability/vets at https://cdn-static.findly.com/wp-content/uploads/sites/954/2019/09/Equal-Employment-Opportunities-EEO-Policy.pdf