BitMEX is the world’s leading cryptocurrency derivatives trading platform, which has pioneered cryptocurrency trading through relentless commitment to change, and continues to set benchmarks for innovation, liquidity, and security today.
As the world's most advanced peer-to-peer crypto-products trading platform and API, BitMEX gives knowledge, confidence, and precision to hundreds of thousands of traders, transacting billions of USD a day.
Join us, as we build a thriving cryptocurrency ecosystem through strategic investments in emerging cryptocurrency technology, and create the future of digital financial services.
As a member of the Risk Management team you will assist in providing services supporting the management of the group's financial risk exposure.
This role will be responsible for managing and supporting the infrastructure which drives our industry-leading electronic risk management platform. The reliability and scalability of our technology is key to our success and this position will work with our core development teams to help design highly available and fault tolerant systems.
Manage and support our AWS cloud infrastructure, EC2 instances and Kubernetes clusters
Support development teams to adopt best practices for running their applications in the cloud
Improve platform automation, testing, monitoring, telemetry, and alerting
Collaborate with our security team to ensure vulnerabilities are patched and audit recommendations are addressed in a timely manner
Develop disaster recovery capabilities to ensure our business can continue to operate in the event of a technology failure
Minimum 6 years DevOps experience supporting and managing Linux based infrastructure
Computer science or engineering degree preferred - strong understanding of fundamental Computer Science principles is required
Comfortable with at least one programming language such as Java or Python.
Able to work under minimal supervision to identify problem areas requiring improvement and seeing a plan through to implementation
Be detail-orientated with experience of end to end troubleshooting and working closely with development teams to solve problems
Strong AWS knowledge and experience or related cloud providers
Knowledge of Docker and multi-stage builds, registries, and best image practices
Deep understanding of managing and supporting Kubernetes clusters. Knowledge of;
microservices architectures and internal components
pod liveness monitoring and metrics practices
running applications in high availability mode (HA) on Kubernetes
network architecture how traffic moves within pods, between clusters, load balancers, and the internet
Be proficient with terraform, helm chart deployments and vault
Familiarity with CI/CD tools such as GitLab
Experience supporting distributed systems communicating over a messaging bus (such as Kafka, NATS, Tibco or LBM)
Familiar with instrumentation solutions such as Prometheus, Telegraf, InfluxDB, Grafana and Lightstep.
Experience working at an Investment banking or finance related field (e.g. Equities, FX, HFT, Electronic Trading)
Experience with AWS EKS
Experience with supporting Q/KDB+ is a big plus.
Knowledge of continuous automation software such as Chef or Puppet