> job detail
T
⚙️Data Engineer
Engineering-Applied Science/Machine Learning/Data Science
Tekion · India
// classified as
Data Engineer (Pipelines, infra, ingestion, ETL.)
posted
78d ago
location
India
languages
go, java, python
tools
aws, azure, docker
> stack
gojavapythonsqlawsazuredockerkafkakubernetess3sparkairflow
> education
msphd
> description
<div class="content-intro"><h2 style="text-align: justify;"><strong>About Tekion:</strong></h2>
<p style="text-align: justify;">Positively disrupting an industry that has not seen any innovation in over 50 years, Tekion has challenged the paradigm with the first and fastest cloud-native automotive platform that includes the revolutionary Automotive Retail Cloud (ARC) for retailers, Automotive Enterprise Cloud (AEC) for manufacturers and other large automotive enterprises and Automotive Partner Cloud (APC) for technology and industry partners. Tekion connects the entire spectrum of the automotive retail ecosystem through one seamless platform. The transformative platform uses cutting-edge technology, big data, machine learning, and AI to seamlessly bring together OEMs, retailers/dealers and consumers. With its highly configurable integration and greater customer engagement capabilities, Tekion is enabling the best automotive retail experiences ever. Tekion employs close to 3,000 people across North America, Asia and Europe.</p></div><p><span data-contrast="auto">We are seeking a highly accomplished leader in Applied AI and Machine Learning to drive Tekion’s end-to-end AI strategy, research innovation, and production-scale ML platform execution. This role combines deep scientific expertise with strong systems and platform engineering capabilities to translate advanced ML and LLM research into reliable, high-performance, enterprise-grade products.</span><span data-ccp-props="{"201341983":0,"335551550":6,"335551620":6,"335559685":142,"335559740":276}"> </span></p>
<p><span data-contrast="auto">The ideal candidate will shape technical vision, lead cross-functional execution, productionize ML systems at scale, and establish best-in-class practices across the full machine learning lifecycle.</span><span data-ccp-props="{"201341983":0,"335551550":6,"335551620":6,"335559685":142,"335559740":276}"> </span></p>
<p><strong><span data-contrast="auto"><span data-ccp-parastyle="heading 1">Key</span><span data-ccp-parastyle="heading 1"> </span><span data-ccp-parastyle="heading 1">Responsibilities</span></span></strong><span data-ccp-props="{"201341983":0,"335559685":0,"335559740":276}"> </span></p>
<p><strong><span data-contrast="auto">Strategic Leadership & Innovation</span></strong><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Architect and execute Tekion’s strategic vision for Applied AI and Machine Learning, ensuring strong alignment with business objectives and industry needs.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Drive the R&D roadmap by identifying emerging technological opportunities and delivering scientifically grounded innovations.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Serve as the primary technical liaison between the R&D organization and executive leadership.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Contribute to the broader scientific community through publications and participation in leading academic conferences and journals.</span></li>
</ul>
<p><strong><span data-contrast="auto">Cross-Functional Delivery</span></strong><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Partner closely with Product, Engineering, Data, and Business teams to design and integrate advanced ML capabilities into core products and services.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Translate applied science prototypes (tabular ML, NLP/LLMs, recommendation systems, forecasting) into scalable production services.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Review, refactor, and optimize data science models for production readiness.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Mentor applied scientists and engineers, fostering a culture of technical excellence and innovation.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
</ul>
<p><strong><span data-contrast="auto">ML Platform & Production Engineering</span></strong><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Build and operate robust CI/CD pipelines for machine learning systems.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Develop high-performance inference microservices (REST/gRPC) with schema versioning, structured outputs, and strict p95 latency targets.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Integrate with the LLM Gateway/MCP, including prompt and configuration versioning.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Design and implement batch and streaming data pipelines using technologies such as Airflow/Kubeflow, Spark/Flink, and Kafka.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Collaborate on enterprise system architecture with data engineers, platform teams, and architects</span><strong><span data-contrast="auto">.</span></strong><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
</ul>
<p><strong><span data-contrast="auto">LLM & Agentic Systems Excellence</span></strong><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Implement advanced prompt management frameworks, including versioning, A/B testing, guardrails, and dynamic orchestration.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Monitor, detect, and mitigate risks unique to LLMs and agent-based systems.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Establish best practices for safe, reliable, and cost-efficient LLM deployment at scale.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
</ul>
<p><strong><span data-contrast="auto">Lifecycle Management, Observability & Reliability</span></strong><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Own the end-to-end model and feature lifecycle, including feature store strategy, model/agent registry, versioning, and lineage.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Build deep observability across traces, logs, metrics, drift detection, model performance, safety signals, and cost tracking.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Ensure real-time service reliability through autoscaling, caching, circuit breakers, retries/fallbacks, and graceful degradation.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Establish robust model evaluation frameworks and clearly quantify business impact for executive stakeholders.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Define and govern best practices across the full ML lifecycle while championing ethical and responsible AI</span><strong><span data-contrast="auto">.</span></strong><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
</ul>
<p><strong><span data-contrast="auto">Developer Experience & Enablement</span></strong><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Create reusable templates, SDKs, CLIs, sandbox datasets, and documentation that make ML delivery fast, reliable, and repeatable across teams.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Drive platform standardization to make shipping ML the default path within the organization</span><strong><span data-contrast="auto">.</span></strong><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
</ul>
<p><span data-ccp-props="{"201341983":0,"335559685":720,"335559731":0,"335559738":91,"335559740":276,"335559991":361}"> </span><strong><span data-contrast="auto">Core Competencies & Technical Expertise</span></strong> <br><span data-contrast="auto">T</span><span data-contrast="auto">he successful candidate will demonstrate mastery in the following areas:</span><span data-ccp-props="{"201341983":0,"335559685":473,"335559738":91,"335559740":276,"335559991":361}"> </span></p>
<p><strong><span data-contrast="auto">Foundational Expertise</span></strong><span data-contrast="auto">: Deep, theoretical and practical expertise in Machine Learning, Deep Learning, Causal Inference, and Explainable AI.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></p>
<p><strong><span data-contrast="auto">Statistical Rigor</span></strong><span data-contrast="auto">: Advanced proficiency in applied probability and statistics to derive and validate insights from complex, high-dimensional data.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></p>
<p><strong><span data-contrast="auto">Deep Learning</span></strong><span data-contrast="auto">:</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Expert-level proficiency with frameworks such as TensorFlow, Keras, and PyTorch.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Extensive experience implementing advanced neural network architectures.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Practical application of Computer Vision (e.g., OpenCV) and Natural Language Processing (e.g., spaCy) methodologies.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
</ul>
<p><strong><span data-contrast="auto">Large Language Models (LLMs)</span></strong><span data-contrast="auto">: Demonstrated experience with Large Language Models, including advanced prompt engineering, fine-tuning, and deployment for specific business applications.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></p>
<p><strong><span data-contrast="auto">Technical Proficiencies</span></strong><span data-contrast="auto">:</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Advanced programming skills in Python and mastery of SQL. Familiarity with distributed computing frameworks (e.g., Spark) is advantageous.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Proficiency with cloud computing platforms (GCP, Azure, AWS).</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Expertise in experimental design (A/B testing, causal inference).</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Proficient in version control systems (Git).</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
</ul>
<p><strong><span data-contrast="auto"> </span></strong> <strong><span data-contrast="auto">Basic & Preferred Qualifications</span></strong><span data-ccp-props="{"201341983":0,"335559685":360,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Advanced degree (M.S. or Ph.D. preferred) in Computer Science, Statistics, Operations Research, Physics, or a related quantitative discipline.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">6+ years of post-academic experience in applied science, machine learning, or quantitative research roles, with a strong track record of translating complex models into measurable business impact.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Demonstrated success solving difficult, business-critical problems using rigorous, data-driven methodologies.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Proven hands-on experience in programming, large-scale data manipulation, and building production-grade models in real-world business environments.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Strong data visualization and executive communication skills, with the ability to translate complex analytical findings into clear, actionable insights for diverse stakeholders.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
</ul>
<p><strong><span data-contrast="auto">LLM & Advanced AI Systems</span></strong><span data-ccp-props="{"201341983":0,"335559685":360,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Practical experience with LLMs, retrieval systems, vector databases, and graph/knowledge stores.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Hands-on experience with orchestration frameworks such as LangChain, LlamaIndex, OpenAI function calling, AgentKit, or similar ecosystems.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Solid understanding of modern agent architectures (reactive, planning, and retrieval-augmented agents) and safe execution patterns.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
</ul>
<p><strong><span data-contrast="auto">Software Engineering & Distributed Systems</span></strong><span data-ccp-props="{"201341983":0,"335559685":360,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Strong software engineering fundamentals, including Python and at least one of Java, Go, or Scala.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Experience with API design, concurrency, testing strategies, and production code quality standards.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Proven experience building and operating microservices using REST/gRPC.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Hands-on experience with Docker, Kubernetes, and service mesh environments.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Strong performance and reliability engineering mindset.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
</ul>
<p><strong><span data-contrast="auto">Data & Pipeline Engineering</span></strong><span data-ccp-props="{"201341983":0,"335559685":360,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Experience designing and operating batch and streaming pipelines using Airflow, Kubeflow, or similar orchestration tools.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Working knowledge of Spark or Flink for distributed data processing.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Experience with streaming platforms such as Kafka or Kinesis.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Strong grounding in data quality, validation, and governance practices.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
</ul>
<p><strong><span data-contrast="auto">MLOps, Observability & Reliability</span></strong><span data-ccp-props="{"201341983":0,"335559685":360,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Experience with experiment tracking and model registries (e.g., MLflow), feature stores, A/B testing, shadow deployments, and drift detection.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Deep observability experience using tools such as OpenTelemetry, Prometheus, and Grafana.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Strong debugging skills for latency, tail performance, and memory/CPU bottlenecks.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
</ul>
<p><strong><span data-contrast="auto">Cloud, Security & Compliance</span></strong><span data-ccp-props="{"201341983":0,"335559685":360,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Strong cloud experience, preferably AWS (IAM, ECS/EKS, S3, RDS/DynamoDB, Step Functions, Lambda), including cost optimization practices.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Experience with secrets management, RBAC/ABAC, PII handling, and auditability requirements in production systems.</span><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></li>
</ul>
<p><span data-ccp-props="{"201341983":0,"335559685":360,"335559740":276}"> </span><strong><span data-contrast="auto">Ideal Candidate Profile</span></strong><span data-ccp-props="{"201341983":0,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">The ideal candidate is a technically exceptional Applied AI leader who combines deep scientific rigor with strong production engineering discipline. They have a proven ability to translate advanced machine learning and LLM research into scalable, reliable, and business-impacting systems.</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">This individual operates comfortably across the full spectrum—from research ideation and model development to platform architecture, production deployment, and real-time reliability. They bring strong ownership, systems thinking, and the ability to influence both technical teams and executive stakeholders</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
</ul>
<p><span data-ccp-props="{"201341983":0,"335559740":276}"> </span><strong><span data-contrast="auto"><span data-ccp-parastyle="heading 1">Perks and Benefits</span></span></strong><span data-ccp-props="{"201341983":0,"335559685":113,"335559740":276}"> </span></p>
<ul>
<li><span data-contrast="auto">Competitive compensation</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Generous stock options</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Medical Insurance coverage</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
<li><span data-contrast="auto">Work with some of the brightest minds from Silicon Valley’s most dominant and successful companies</span><span data-ccp-props="{"201341983":0,"335559738":91,"335559740":276}"> </span></li>
</ul><div class="content-conclusion"><p> </p>
<hr>
<p><span data-contrast="auto">Tekion is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, victim of violence or having a family member who is a victim of violence, the intersectionality of two or more protected categories, or other applicable legally protected characteristics.</span><span data-ccp-props="{}"> </span></p>
<p><span style="font-size: 14px;" data-contrast="auto">For more information on our privacy practices, please refer to our Applicant Privacy Notice </span><a style="font-size: 14px;" href="https://tekion.com/legal/privacy/applicant-and-candidate-privacy-notice"><span data-contrast="none"><span data-ccp-charstyle="Hyperlink">h</span><span data-ccp-charstyle="Hyperlink">e</span><span data-ccp-charstyle="Hyperlink">re</span></span></a>.</p></div>