> job detail
D
⚙️Data Engineer
Big Data Software Engineer
DoubleVerify · Tel Aviv-Israel
// classified as
Data Engineer (Pipelines, infra, ingestion, ETL.)
posted
43d ago
location
Tel Aviv-Israel
languages
java, kotlin, scala
tools
bigquery, clickhouse, databricks
> stack
javakotlinscalabigqueryclickhousedatabricksdeltadockerkafkamongodbnosqlsnowflakespark
> education
ms
> description
<p><strong>Who we are</strong></p>
<p>DoubleVerify is an Israeli-founded big data analytics company (Stock:<a href="https://www.google.com/finance/quote/DV:NYSE" target="_blank"> NYSE: DV</a>). We track and analyze tens of billions of ads every day for the biggest brands in the world.<br>We operate at a massive scale, handling over 100B events per day and over 1M RPS at peak, we process events in real-time at low latencies (ms) and analyze over 2.5M video years <strong>every day</strong>. We verify that all ads are fraud free, appear next to appropriate content, appear to people in the right geography and measure the viewability and user’s engagement throughout the ad’s lifecycle. </p>
<p>We are global, with HQ in NYC and R&D centers in Tel Aviv, New York, Finland, Berlin, Belgium and San Diego. We work in a fast-paced environment and have a lot of challenges to solve. If you like to work in a huge scale environment and want to help us build products that have a huge impact on the industry, and the web - then your place is with us.</p>
<h4> </h4>
<h4><strong>What you'll do</strong></h4>
<p>You will join the Traffic Team, a core engineering team operating at the heart of the company's measurement system.</p>
<p>You will Build and maintain high-throughput streaming systems processing 100B+ daily events.</p>
<p>Tackle performance and optimization challenges that make interview questions actually relevant</p>
<p>Design and implement real-time data processing pipelines using Kafka, Databricks/Spark, and distributed computing</p>
<p>Lead projects end-to-end: design, development, integration, deployment, and production support</p>
<p> </p>
<h4><strong>Who you are</strong></h4>
<ul>
<li>5+ years of software development experience with JVM-based languages (Scala, Java, Kotlin) with strong functional programming skills</li>
<li>Strong grasp of Computer Science fundamentals: functional programming paradigms, object-oriented design, data structures, concurrent/distributed systems</li>
<li>Proven experience with high-scale, real-time streaming systems and big data processing.</li>
<li>Experience and deep understanding of a wide array of technologies, including:</li>
<li>Stream processing: Kafka, Kafka Streams, or similar frameworks (Flink, Spark Streaming, Pulsar).</li>
<li>Concurrency frameworks: Akka, Pekko, or equivalent actor systems/reactive programming.</li>
<li>Data platforms: Databricks, Spark, Delta Lake, or similar lakehouse technologies.</li>
<li>Microservices & containerization: Docker, Kubernetes.</li>
<li>Modern databases: Experience across analytical databases (ClickHouse, Snowflake, BigQuery), NoSQL (Cassandra, MongoDB), and columnar stores</li>
<li>Cloud infrastructure: GCP or AWS.</li>
<li>Hands-on experience developing with AI tools (Cursor, Claude Code, etc..) .</li>
<li>Strong DevOps mindset: CI/CD pipelines (GitLab preferred), infrastructure as code, monitoring/alerting.</li>
<li>BSc in Computer Science or equivalent experience.</li>
<li>Excellent communication skills and ability to collaborate across teams.</li>
</ul>
<p> </p>
<h4><strong>Nice to have</strong></h4>
<ul>
<li>Previous experience in ad-tech.</li>
<li>Experience with schema evolution and data serialization (Avro, Protobuf, Parquet)</li>
</ul>
<p>#Hybrid#</p>