> job detail
N
⚙️Data Engineer
Data Quality Analyst ( ETL/Data Warehouse ) (IN)
NetApp, Inc. · IN
// classified as
Data Engineer (Pipelines, infra, ingestion, ETL.)
posted
1d ago
location
IN
languages
python, shell, sql
tools
aws, azure, oracle
> stack
pythonshellsqlawsazureoraclesnowflaketableau
> description
<div><div style="padding:10.0px 0.0px;border:1.0px solid transparent"><div style="word-wrap:break-word"><H2 style="margin:0.0px">Job Summary</H2>
</div><div><p>This role is responsible for ensuring the accuracy, reliability, and quality of enterprise ETL pipelines and AI-ready datasets. The individual will support analytics, business intelligence, and AI/ML initiatives by validating data pipelines, improving data quality frameworks, and enabling trustworthy model inputs.</p></div></div><div style="padding:10.0px 0.0px;border:1.0px solid transparent"><div style="word-wrap:break-word"><H2 style="margin:0.0px">Job Responsibilities</H2>
</div><div><ul>
<li>Lead ETL and data quality testing initiatives across multiple data platforms</li>
<li>Validate data pipelines supporting AI/ML models and analytics dashboards</li>
<li>Implement AI-assisted testing techniques (anomaly detection, rule discovery)</li>
<li>Develop reusable SQL/Python-based frameworks for automated data validation</li>
<li>Review training datasets for consistency, completeness, and data integrity</li>
<li>Identify and resolve data quality issues across batch and incremental loads</li>
<li>Partner with data engineers and AI teams to ensure reliability of model inputs</li>
<li>Ensure adherence to data governance, QA standards, and best practices</li>
<li>Support Agile delivery through CI/CD-aligned data testing processes</li>
</ul></div></div><div style="padding:10.0px 0.0px;border:1.0px solid transparent"><div style="word-wrap:break-word"><H2 style="margin:0.0px">Job Requirement</H2>
</div><div><p><strong>Experience:</strong> </p>
<ul>
<li>6–10 years in ETL/Data Warehouse testing and data quality engineering</li>
<li>Strong SQL expertise (joins, aggregations, subqueries)</li>
<li>Solid understanding of data warehousing concepts (fact/dimension models, star schema)</li>
<li>Hands-on experience in ETL testing tools (Informatica, Talend, DataStage, ADF, etc.)</li>
<li>Experience with databases such as Oracle, SQL Server, Snowflake, Teradata, PostgreSQL</li>
<li>Exposure to AI/ML data pipelines and feature engineering validation</li>
<li>Familiarity with data observability or AI-enabled QA tools</li>
<li>Working knowledge of Agile and CI/CD practices</li>
</ul>
<p><strong>Good to Have:</strong></p>
<ul>
<li>Experience with cloud platforms (AWS, Azure, GCP, Snowflake)</li>
<li>Basic Python or shell scripting for data validation</li>
<li>Exposure to BI tools (Power BI, Tableau) validation</li>
<li>Experience in Agile/Scrum environments</li>
</ul>
<p><strong>Education:</strong></p>
<ul>
<li>Bachelor’s degree in Computer Science, IT, Engineering, or equivalent</li>
</ul></div></div></div>