Provide production support on a shift according to the team on-call roster.
Work on the customer and internal engineering/implementation team raised tickets while not on-call for production support.
Continuously monitor the health and performance of our services, systems, and infrastructure.
Granicus builds and maintains technology that is transforming the Govtech industry by bringing governments and its constituents together. They serve 5,500 federal, state, and local government agencies and more than 300 million citizen subscribers, and are known for being one of the best companies to work for.
Build and maintain Python fleet tracking system that manages the full lifecycle of servers.
Build server management tooling that automates provisioning, health checks, GPU diagnostics, recovery and alerting.
Create and maintain metrics, dashboards, and alerting for hardware health across the fleet.
FAL is committed to keeping a large fleet of GPU servers healthy and productive. They offer a collaborative and supportive culture with learning and growth opportunities.
Administer and support cloud-native infrastructure powering Telecommunication systems, with a strong focus on AWS-hosted services.
Perform day-to-day system administration tasks within AWS GovCloud, including provisioning, configuring, monitoring, and patching Linux-based virtual machines.
Monitor cloud system performance using CloudWatch, CloudTrail, and Splunk, diagnosing and resolving infrastructure issues to maintain 24/7 uptime.
TekSynap is a fast-growing high-tech company that understands the pace of technology and the need for a comprehensive information management environment. They aim to utilize the best of information technology to meet the business needs of Federal Government customers.