We invest in people who change the way the world works.

Interested in working with them?
Tell us about your professional DNA and get discovered by the amazing companies in our network.

Staff Data Engineer



Software Engineering, Data Science
New York, NY, USA
Posted on Saturday, March 23, 2024
Join our team at ASAPP, where we're developing transformative Vertical AI designed to improve customer experience. Recognized by Forbes AI 50, ASAPP designs generative AI solutions that transform the customer engagement practices of Fortune 500 companies. With our automation and simplified work processes, we empower people to reach their full potential and create exceptional experiences for everyone involved. Work with our team of talented researchers, engineers, scientists, and specialists to help solve some of the biggest and most complex problems the world is facing.
The Data Engineering & Analytics team (DEA) at ASAPP powers the core of our data and analytics products. ASAPP's products are based on natural language processing and serve tens of millions of end-users in real time. We need sophisticated metrics to monitor and continuously improve our systems. We are seeking a Staff Data Engineer to serve as both a technical leader and a core individual contributor, by designing and building analytic data feeds for both our business partners and internal stakeholders.
Applicants with all or some relevant combination of the requirements listed below are encouraged to apply. This is a hybrid role, with a preference for candidates in proximity to either of our NYC or Mountain View offices

What you'll do

  • Lead the batch analytics team by providing the groundwork to modernize our data analytics architecture
  • Design and maintain our data warehouse to facilitate analysis across hundreds of systems events
  • Rethink and influence strategy and roadmap for building efficient data solutions and scalable data warehouses
  • Review code for style and correctness across the entire team
  • Write production-grade Redshift, Athena, Snowflake & Spark SQL queries
  • Manage and maintain Airflow ETL jobs
  • Test query logic against sample scenarios
  • Work across teams to gather requirements and understand reporting needs
  • Investigate metric discrepancies and data anomalies
  • Debug and optimize queries for other business units
  • Review schema changes across various engineering teams
  • Maintain high-quality documentation for our metrics and data feeds
  • Work with stakeholders in Data Infrastructure, Engineering, Product and Customer Strategy to assist with data-related technical issues and build scalable cross platform reporting framework
  • Participate in, and co-manage our on-call rotation to keep production pipelines up and running

What you'll need

  • 7+ years industry experience with clear examples of strategic technical problem solving and implementation
  • Expertise in at least one flavor of SQL. (We use Amazon Redshift, MySQL, Athena and Snowflake)
  • Strong experience with data warehousing (e.g. Snowflake (preferred), Redshift, BigQuery, or similar)
  • Experience with dimensional data modeling and schema design
  • Experience using developer-oriented data pipeline and workflow orchestration (e.g. Airflow (preferred), dbt, dagster or similar)
  • Experience with cloud computing services (AWS (preferred), GCP, Azure or similar)
  • Proficiency in a high-level programming language, especially in terms of reading and comprehending other developers’ code and intentions. (We use Python, Scala, and Go)
  • Deep technical knowledge of data exchange and serialization formats such as Protobuf, YAML, JSON, and XML
  • Familiarity with BI & Analytics tools (e.g. Looker, Tableau, Sisense, Sigma computing or similar)
  • Familiarity with streaming data technologies for low-latency data processing (e.g. Apache Spark/Flink, Apache Kafka, Snowpipe or similar)
  • Familiarity with Terraform, Kubernetes and Docker
  • Understanding of modern data storage formats and tools (e.g. parquet, Avro, Delta Lake)
  • Knowledge of modern data design and storage patterns (e.g. incremental updates, partitioning and segmentation, rebuilds and backfills)

What we'd like to see

  • Experience working at a startup preferred
  • Excellent communication skills - (Slack/Email/Documents)
  • Experienced with end user management & communication (cross team as well as external)
  • Must thrive in a fast paced environment and be able to work independently with urgency
  • Can work effectively remotely (able to be proactive about managing blockers, proactive on reaching out and asking questions, and participating in team activities)
  • Experienced in writing technical data design docs (pipeline design, dataflow, schema design)
  • Can scope and breakdown projects, communicate and collaborate progress and blockers effectively with your manager, team, and stakeholders
  • Good at task management & capacity tracking (JIRA (preferred))
ASAPP is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability, age, or veteran status. If you have a disability and need assistance with our employment application process, please email us at careers@asapp.com to obtain assistance. #LI-AG1 #LI-Remote