WEKA Logo

WEKA

WEKA Platform Administrator

Reposted 19 Days Ago
Be an Early Applicant
In-Office
7 Locations
Mid level
In-Office
7 Locations
Mid level
As a WEKA Platform Administrator, you will manage and optimize storage systems, engage with customers, and ensure performance in high-performance computing environments.
The summary above was generated by AI

WEKA is architecting a new approach to the enterprise data stack built for the age of reasoning. NeuralMesh by WEKA sets the standard for agentic AI data infrastructure with a cloud and AI-native software solution that can be deployed anywhere. It transforms legacy data silos into data pipelines that dramatically increase GPU utilization and make AI model training and inference, machine learning, and other compute-intensive workloads run faster, work more efficiently, and consume less energy.
WEKA is a pre-IPO, growth-stage company on a hyper-growth trajectory. We’ve raised $375M in capital with dozens of world-class venture capital and strategic investors. We help the world’s largest and most innovative enterprises and research organizations, including 12 of the Fortune 50, achieve discoveries, insights, and business outcomes faster and more sustainably. We’re passionate about solving our customers’ most complex data challenges to accelerate intelligent innovation and business value. If you share our passion, we invite you to join us on this exciting journey.

What you'll be doing:

As a WEKA Platform Administration, you'll be at the forefront of this revolution. You'll work within our Customer Success team maintaining and optimizing our platform for unparalleled performance and helping our Premium Support customers achieve breakthroughs in their fields.

Responsibilities:

  • Storage management and optimization, including upgrades, audits, performance monitoring, file system creation, NFS / SMB / S3 configuration as needed, etc
  • Documentation: create and maintain architecture, operational and workflow documentation of the WEKA environment according to best practices
  • Service management: interact and integrate with operational service management teams (internally and externally) to ensure smooth, continuous storage operations
  • Customer management: serve as a trusted adviser to the customer, providing technical mentorship on WEKA solutions
  • Partnering with 3rd party partners to implement solutions based on customer’s requirements and designs
  • Effectively communicating with internal customers, peers, and managers regarding storage administration issues.
  • Continued learning on emerging technologies in HPC, Kubernetes, Cloud and Storage

Requirements:

  • 3+ years of HPC (high-performance compute) experience with a strong focus on distributed systems, parallel file systems, or high-performance networking.
  • Proficiency in Linux environments including command-level programming.
  • Demonstrated experience with architecting,  performance optimization and troubleshooting in complex, distributed systems.
  • Familiarity with HPC interconnects (e.g., InfiniBand, RoCE) and protocols.
  • Understanding of storage technologies (NVMe, Object, Block, File) 
  • Excellent problem-solving skills and a passion for tackling challenging technical issues.
  • Experience deploying large Cloud deployments and working knowledge of at least one of the major Clouds - AWS, Oracle Cloud, Google Cloud, or Microsoft Azure.
  • Experience with ticketing system tools (Jira, etc) and competencies in Major Incident Management
  • Experience implementing system monitoring/alerting tools for Enterprise IT infrastructures 
  • Able to proficiently coordinate and troubleshoot problems in partnership with internal teams and external 3rd party vendors
  • Experience evaluating & managing outages to business-critical infrastructure
  • Able to meet service-level agreement (SLA) deadlines and work within a fast-paced environment

Bonus Points For:

  • Kubernetes proficiency
  • Experience with cloud-native HPC deployments (AWS, Azure, GCP).
  • Knowledge of AI/ML frameworks and their data access patterns.
  • Contributions to open-source HPC projects.
  • Experience with kernel bypass technologies (e.g., SPDK, DPDK).

How We Work: The WEKA Way

  • We are Accountable: We take full ownership, always–even when things don’t go as planned. We lead with integrity, show up with responsibility & ownership, and hold ourselves and each other to the highest standards.
  • We are Brave: We question the status quo, push boundaries, and take smart risks when needed. We welcome challenges and embrace debates as opportunities for growth, turning courage into fuel for innovation. 
  • We are Collaborative: True collaboration isn’t only about working together. It’s about lifting one another up to succeed collectively. We are team-oriented and we communicate with empathy and respect. We challenge each other and conduct positive conflict resolution. We are being transparent about our goals and the results we achieve. Together, we’re unstoppable. 
  • We are Customer Centric: Our customers are at the heart of everything we do. We actively listen and prioritize the success of our customers, and every decision we make is driven by how we can better serve, support, and empower them to succeed. When our customers win, we win.

Concerned you don’t meet every qualification? Don’t let it stop you from applying!
Studies have shown that traditionally underrepresented groups may be less likely to apply for jobs if they don’t meet every qualification specified. WEKA is committed to building a diverse, inclusive, and authentic workplace. If you are excited about this position but are concerned your past work experience doesn’t match up perfectly with the job description, we encourage you to apply anyway – you may be just the right candidate for this or other roles at WEKA.

WEKA is an equal opportunity employer that prohibits discrimination and harassment of any kind. We provide equal opportunities to all employees and applicants for employment without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.

The base salary hiring wage range for this position which the Company reasonably and in good faith expects to pay for the position in the specified geographic areas or locations, is $140,000 - $185,000. Final compensation will be dependent on various factors relevant to the position and candidate such as geographical location, candidate qualifications, certifications, relevant job-related work experience, education, skillset and other relevant business and organizational factors, consistent with applicable law. In addition, the position may include some of the following comprehensive benefits such as Medical, Dental, Vision, Life, 401(K), Flexible Time off (FTO), sick time, leave of absence as per the FMLA and other relevant leave laws.



Top Skills

AWS
Block Storage
File Storage
GCP
Hpc
Infiniband
JIRA
Kubernetes
Linux
Azure
Nvme
Object Storage
Oracle Cloud
Roce

Similar Jobs

An Hour Ago
In-Office
Toronto, ON, CAN
Senior level
Senior level
Artificial Intelligence • Fintech • Payments • Financial Services • Generative AI
Lead the engineering team to build stablecoin infrastructure, integrating blockchain and financial systems while ensuring secure and scalable architecture.
Top Skills: BlockchainC#C++Fx SystemsJavaKotlinPythonRust
An Hour Ago
In-Office or Remote
7 Locations
Senior level
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Lead the development of advanced machine learning models and AI solutions for customer support, focusing on chatbot and voice automation technologies.
Top Skills: JaxPythonPyTorchTensorFlow
An Hour Ago
In-Office or Remote
7 Locations
Expert/Leader
Expert/Leader
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The Staff Software Engineer will design, build, and maintain Square's catalog system, ensuring reliability and scalability while mentoring other engineers.
Top Skills: AWSDynamoDBEnvoyGoGrpcKafkaKubernetesMySQLOpensearchProtocol BuffersRedisTemporal IoTerraform

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account