Offers “Roche”

Expires soon Roche

Principal GPFS Engineer

  • San Francisco (City and County of San Francisco)
  • IT development

Job description



·  Job facts

As a core member of the Roche Science Infrastructure (RSI) team, the GPFS engineer will be responsible for developing and maintaining all aspects of the global performance data tier in RSI. Working closely with other members of the RSI team,Enterprise Operations and Engineering groups, the successful candidate will rely on their experience, knowledge and expertise to support, enhance and optimize the global GPFS solution deployed across multiple geographic regions. Testing, tuning and updating all GPFS installations, keeping current with releases and maintaining configurations and data management for performance and stability using modern tools and toolchains. The successful candidate will also be responsible for providing expertise in the development of the overall data lifecycle management strategy, integrating new capabilities into GPFS. Recognized as an expert in the field, they will provide technical consultancy to other members of Infrastructure Services, with demonstrated complex problem solving abilities. Some experience in mentoring and leading others in small team environments is highly desirable. The position is global and may be placed in one of several geographic locations.

Main responsibilities are:

· 
Design (architect), implement and troubleshoot large-scale (petabytes) storage systems. This includes developing technical drawings including all required cables and connectivity to existing systems, and communicating with key stakeholders.

· 
Serve as a GPFS subject matter expert for the RSI team, as well as other colleagues both within and outside of RSI and the broader IS organization.

· 
Responsible for the on-going integration, policies and ease of use of GPFS into the broader HPC environment.

· 
Develop and execute test plans for filesystem upgrades and resolving issues, including working with vendors.

· 
Resolve user-reported application issues (e.g. filesystem, RDMA interconnect, Performance Degradation, Protocol Issues, etc.)

· 
Maintain a current development/test environment for GPFS enhancements both on-prem and cloud based.

· 
Work closely with other members of the RSI team ensure that GPFS is optimally used across the services offered by RSI (network, I&AM, tiered storage, containers, public cloud, etc.).

Qualifications:

The principal GPFS Engineer will be an experienced IT professional. With a Bachelor's degree (advanced degree preferred) in a relevant field of technology, science or business and possessing the following qualifications:

· 
5-10 years of experience as a High-Performance Computing parallel filesystem Storage Administrator, with experience with IBM Spectrum Scale (GPFS), Lustre, or equivalent. Experience with optimizing for performance, reliability, and security.

· 
In-depth knowledge of HPC parallel filesystems and the ability to troubleshoot complex problems. Must be comfortable with monitoring and managing clustered filesystems, and be able to examine GPL driver code when required.

· 
Experience with deploying parallel filesystem upgrades in a rolling fashion with no overall system downtime.

· 
Experience with GPFS Cluster Export Services, Clustered NFS, GPFS Multi-cluster

· 
In-depth knowledge of Linux NFS server/client implementation and ability to troubleshoot NFS issues.

· 
In-depth knowledge of SAN technologies (e.g., FC, FCoE, RoCE, NVMoF, iSER, SRP) and awareness of high-level protocol function, management approaches, and performance tuning.

· 
Deep experience with InfiniBand or OmniPath high speed fabrics, including subnet management, IPoIB and/or IPoOPA mechanisms, fabric topology and health monitoring and integration with MPI.

· 
Knowledge of Ethernet networking (VLANs, etc.).

· 
Knowledge of configuration management tools and tool chains (e.g., Ansible, Jenkins).

· 
Working knowledge of scripting and programming languages such as C, C++, Fortran Bash, CSH, TSCH, Perl, Python, Ruby.

· 
Good organization skills to balance and prioritize work, and ability to multitask.

· 
Good communication skills to communicate with support personnel, customers, and managers.

·  Who we are

A member of the Roche Group, Genentech has been at the forefront of the biotechnology industry for more than 40 years, using human genetic information to develop novel medicines for serious and life-threatening diseases. Genentech has multiple therapies on the market for cancer & other serious illnesses. Please take this opportunity to learn about Genentech where we believe that our employees are our most important asset & are dedicated to remaining a great place to work.

Roche is an equal opportunity employer and strictly prohibits unlawful discrimination based upon an individual's race, color, religion, gender, sexual orientation, gender identity/expression, national origin/ancestry, age, mental/physical disability, medical condition, marital status, veteran status, or any other characteristic protected by law.

Make every future a success.
  • Job directory
  • Business directory