Linux Operations Engineer

  • Contract

Company Description

Artech is the 10th Largest IT Staffing Company in the US, according to Staffing Industry Analysts' 2012 annual report. Artech provides technical expertise to fill gaps in clients' immediate skill-sets availability, deliver emerging technology skill-sets, refresh existing skill base, allow for flexibility in project planning and execution phases, and provide budgeting/financial flexibility by offering contingent labor as a variable cost.

Job Description

We are looking for a candidate with a strong background in infrastructure automation to help us design, implement, and maintain software build engineering processes, product testing and staging environments, and a Linux-based infrastructure.  
 
We're looking for someone that is enthusiastic about new technologies, who's excited about working closely with a small group of engineers and developers, who enjoys taking ownership of their work, who wants to help us think about and design new infrastructure solutions, who wants to introduce new technology ideas, and who wants to learn.
 
Responsibilities 
 Primarily responsible for the day to day creation, deployment, support and maintenance of development software stacks, overseeing build frameworks, repackaging RPM dependencies, improving OS distributions, writing configuration management code, and building software tools to support infrastructure provisioning.
 
Required Experience
 The ideal candidate will have a minimum of 1 year, direct and hands-on experience in a medium to large Linux based environment supporting development activities.  Specific required experience includes:
 
Linux systems administration – (CentOS, Redhat)
Storage (preferably NAS)
Configuration management and automation using tools such as Puppet, Chef, Ansible.
Virtualization platforms (VMWare and OpenStack) Networking (primarily layers 1 – 3), protocols and standard tools such as tcpdump and traceroute
Python or Ruby software development
Shell scripting (bash/sed/awk/etc.)
RPM packaging and package management
Git source code management
Continuous Integration Frameworks (e.g. Jenkins)
Development technology (e.g. PHP, RoR, Python, Java, Groovy, etc.)
Cassandra Database Administration
Familiarity with SOA, SOLR, Splunk, Spark, Nagios, Docker
  
We are really looking for a Junior-Mid level Linux Systems Administrator that has had experience with GIT and Command Line.

  • They have to be flexible with their availability as there is an on-call requirement and a couple evenings a week that are needed.
  • Currently the on-call is 1 week out of 8 and it is typically very light and the evening work is on Tuesday/Thursday evenings.
  • If any on-call requirements happen he will work with the resource to provide comp time.
  • Currently the team consists of 6 Senior members and they are looking for more Junior level to take a lot of the day to day operational functions. There is potential this assignment will be extended past 2016 as well.

In connection with the Project, Consultant shall provide the Resource to perform the following Services:

  • Deploying, Supporting, and Configuring code in support of CloudTV, TVEverywhere, and IPVOD programs.
  • Build and rebuild of various operating environments in support for video delivery systems.
  • Delivering and assuring the performance of technically advanced; highly scalable, fault-tolerant systems.
  • Diagnose problems with both proprietary and open source application.
  • Working with virtual networks (VMware), Web Services, and streaming media
  • Application support, e.g. Unix and Web middleware development and/or QA
  • End to end systems administration of clusters of CentOS Linux dev/test/prod servers.
  • Troubleshoot infrastructure and network issues as possible causes of poor service performance.
  • Takes the lead on defining the requirements for new tools and monitors required for operations.
  • Custom development of monitoring, analytics and troubleshooting tools relating to video delivery systems.
  • Development of tools and processes for managing services running on Unix and Windows servers.
  • Participate in the development and testing of internal tools, scripts and other similar coding projects.
  • Respond to alerts, including those escalated to you by our Global Operations Center.
  • Monitor system health across our environment using OP5, Nagios, Splunk, SiteScope, etc. and assist NOC in diagnosing root cause.
  • Investigate potential problems (Splunk log diving, reproducing issues, bug writing and tracking) and simple network traces and troubleshooting.
  • Escalate issues to development team and working closely in a devOPS type environment.
  • Analyzes systems and makes recommendations to prevent possible problems.
  • Work with ticketing system and respond to system issues such as system exceptions, development and architectural questions.
  • Development and execution of various types of testing lot limited to regression, sit, validation, load, capacity, longevity and stress.

Additional Information

If you are interested, please contact:

Sophia

973-507-7563