Portfolio Careers

Senior Data Engineer

Roboto AI

Roboto AI

Data Science
Posted on Wednesday, April 12, 2023
Senior Data Engineer
At KeyCaliber, we are an early stage company doing big things to advance cybersecurity programs in meaningful ways. You will work side by side with a team of industry veterans who possess decades of experience defending networks and quantifying cyber risk. Using cutting edge technology, data visualization, and advanced machine learning, we are empowering organizations to prioritize their most precious cybersecurity resources. We are here to make the fight not just manageable but winnable and we need someone like you to be a part of our mission!
We are looking for a Senior Data Engineer who has experience building data pipelines that ingest, normalize, and analyze data to power a machine learning system and deliver strategic insights to end users. The ideal candidate is one who is self-driven, excited to work with new technologies, and can help us make rapid progress on a variety of big data projects. While your core responsibilities will be in further building out our data pipeline and leading our data strategy, you will also get the chance to develop your API development and SecDevOps skills and work some of the most interesting data sets in cybersecurity. This is a great role to make a big impact on a product that is already helping organizations address some of their hardest cybersecurity challenges.
Requirements and qualifications
We realize that there is rarely a “perfect candidate” for any role and value finding the right person with enough aptitude to hit the ground running.
While these are some of the qualifications that we feel are essential for success in this role, if you think you would be a great fit and are a little unsure about any particular area, let’s talk!

  • Strong development skills in a data engineering programming language, ideally Python (5+ years)
  • 5+ years experience working with SQL and NoSQL databases, spotting inefficient queries and tuning databases for optimal performance
  • 3+ years experience working with Kafka or another message-based architecture
  • 3+ years experience building ETL pipelines that have processed some heavy workloads
  • 2+ years developing and deploying code in a cloud-based environment of at least one major cloud provider
  • 2+ years developing REST APIs
  • You are cost-conscious and resourceful favoring a good, cost-effective solution that gets the job done
  • You possess a willingness to learn new skills and take on additional responsibilities as they arise
  • You have strong communication skills and are proactive in your updates to the team

What you will do

  • Own the development and rollout of a series of data pipelines to source data from various cyber security controls and network management systems systems and create a unified data model for analytics and reporting
  • Contribute to the design and implementation of multiple product capabilities with a focus on executive level reporting and business insight generation
  • Work with your team and external customers to gather functional and technical specifications
  • Test different setups, configurations, and services with a focus on scalability
  • Incorporate solid coding practices (e.g. peer code reviews, unit testing) and go out of your way to reduce complexity
  • Document architecture, runbooks, and configurations enough so that you could take a vacation and someone else could cover for you
  • Maintain an independent work schedule while collaborating regularly with the team and reporting on progress, blockers, and future concerns
  • Utilize agile development processes and adjust to changes in projects and priorities as necessary as we keep up with the pace of an early-stage startup

We would also be thrilled if you have meet any of the following qualifications but they are definitely not required:

  • Experience developing with Docker and/or Kubernetes
  • Experience working with a graph database like neo4j
  • 2+ years using CI/CD to build, test, and deploy application code
  • Familiarity with cybersecurity concepts and terminology