← back to jobs
> job detail
D
⚙️Data Engineer

Big Data Software Engineer

DoubleVerify · Tel Aviv-Israel
// classified as
Data Engineer (Pipelines, infra, ingestion, ETL.)
posted
43d ago
location
Tel Aviv-Israel
languages
java, kotlin, scala
tools
bigquery, clickhouse, databricks
> stack
javakotlinscalabigqueryclickhousedatabricksdeltadockerkafkamongodbnosqlsnowflakespark
> education
ms
> description
<p><strong>Who we are</strong></p> <p>DoubleVerify is an Israeli-founded big data analytics company (Stock:<a href="https://www.google.com/finance/quote/DV:NYSE" target="_blank"> NYSE: DV</a>). We track and analyze tens of billions of ads every day for the biggest brands in the world.<br>We operate at a massive scale, handling over 100B events per day and over 1M RPS at peak, we process events in real-time at low latencies (ms) and analyze over 2.5M video years <strong>every day</strong>. We verify that all ads are fraud free, appear next to appropriate content, appear to people in the right geography and measure the viewability and user’s engagement throughout the ad’s lifecycle.&nbsp;</p> <p>We are global, with HQ in NYC and R&amp;D centers in Tel Aviv, New York, Finland, Berlin, Belgium and San Diego. We work in a fast-paced environment and have a lot of challenges to solve. If you like to work in a huge scale environment and want to help us build products that have a huge impact on the industry, and the web - then your place is with us.</p> <h4>&nbsp;</h4> <h4><strong>What you'll do</strong></h4> <p>You will join the Traffic Team, a core engineering team operating at the heart of the company's measurement system.</p> <p>You will Build and maintain high-throughput streaming systems processing 100B+ daily events.</p> <p>Tackle performance and optimization challenges that make interview questions actually relevant</p> <p>Design and implement real-time data processing pipelines using Kafka, Databricks/Spark, and distributed computing</p> <p>Lead projects end-to-end: design, development, integration, deployment, and production support</p> <p>&nbsp;</p> <h4><strong>Who you are</strong></h4> <ul> <li>5+ years of software development experience with JVM-based languages (Scala, Java, Kotlin) with strong functional programming skills</li> <li>Strong grasp of Computer Science fundamentals: functional programming paradigms, object-oriented design, data structures, concurrent/distributed systems</li> <li>Proven experience with high-scale, real-time streaming systems and big data processing.</li> <li>Experience and deep understanding of a wide array of technologies, including:</li> <li>Stream processing: Kafka, Kafka Streams, or similar frameworks (Flink, Spark Streaming, Pulsar).</li> <li>Concurrency frameworks: Akka, Pekko, or equivalent actor systems/reactive programming.</li> <li>Data platforms: Databricks, Spark, Delta Lake, or similar lakehouse technologies.</li> <li>Microservices &amp; containerization: Docker, Kubernetes.</li> <li>Modern databases: Experience across analytical databases (ClickHouse, Snowflake, BigQuery), NoSQL (Cassandra, MongoDB), and columnar stores</li> <li>Cloud infrastructure: GCP or AWS.</li> <li>Hands-on experience developing with AI tools (Cursor, Claude Code, etc..) .</li> <li>Strong DevOps mindset: CI/CD pipelines (GitLab preferred), infrastructure as code, monitoring/alerting.</li> <li>BSc in Computer Science or equivalent experience.</li> <li>Excellent communication skills and ability to collaborate across teams.</li> </ul> <p>&nbsp;</p> <h4><strong>Nice to have</strong></h4> <ul> <li>Previous experience in ad-tech.</li> <li>Experience with schema evolution and data serialization (Avro, Protobuf, Parquet)</li> </ul> <p>#Hybrid#</p>