← back to jobs
> job detail
N
⚙️Data Engineer

Data Quality Analyst ( ETL/Data Warehouse ) (IN)

NetApp, Inc. · IN
// classified as
Data Engineer (Pipelines, infra, ingestion, ETL.)
posted
1d ago
location
IN
languages
python, shell, sql
tools
aws, azure, oracle
> stack
pythonshellsqlawsazureoraclesnowflaketableau
> description
<div><div style="padding:10.0px 0.0px;border:1.0px solid transparent"><div style="word-wrap:break-word"><H2 style="margin:0.0px">Job Summary</H2> </div><div><p>This role is responsible for ensuring the accuracy, reliability, and quality of enterprise ETL pipelines and AI-ready datasets. The individual will support analytics, business intelligence, and AI/ML initiatives by validating data pipelines, improving data quality frameworks, and enabling trustworthy model inputs.</p></div></div><div style="padding:10.0px 0.0px;border:1.0px solid transparent"><div style="word-wrap:break-word"><H2 style="margin:0.0px">Job Responsibilities</H2> </div><div><ul> <li>Lead ETL and data quality testing initiatives across multiple data platforms</li> <li>Validate data pipelines supporting AI/ML models and analytics dashboards</li> <li>Implement AI-assisted testing techniques (anomaly detection, rule discovery)</li> <li>Develop reusable SQL/Python-based frameworks for automated data validation</li> <li>Review training datasets for consistency, completeness, and data integrity</li> <li>Identify and resolve data quality issues across batch and incremental loads</li> <li>Partner with data engineers and AI teams to ensure reliability of model inputs</li> <li>Ensure adherence to data governance, QA standards, and best practices</li> <li>Support Agile delivery through CI/CD-aligned data testing processes</li> </ul></div></div><div style="padding:10.0px 0.0px;border:1.0px solid transparent"><div style="word-wrap:break-word"><H2 style="margin:0.0px">Job Requirement</H2> </div><div><p><strong>Experience:</strong> </p> <ul> <li>6–10 years in ETL/Data Warehouse testing and data quality engineering</li> <li>Strong SQL expertise (joins, aggregations, subqueries)</li> <li>Solid understanding of data warehousing concepts (fact/dimension models, star schema)</li> <li>Hands-on experience in ETL testing tools (Informatica, Talend, DataStage, ADF, etc.)</li> <li>Experience with databases such as Oracle, SQL Server, Snowflake, Teradata, PostgreSQL</li> <li>Exposure to AI/ML data pipelines and feature engineering validation</li> <li>Familiarity with data observability or AI-enabled QA tools</li> <li>Working knowledge of Agile and CI/CD practices</li> </ul> <p><strong>Good to Have:</strong></p> <ul> <li>Experience with cloud platforms (AWS, Azure, GCP, Snowflake)</li> <li>Basic Python or shell scripting for data validation</li> <li>Exposure to BI tools (Power BI, Tableau) validation</li> <li>Experience in Agile/Scrum environments</li> </ul> <p><strong>Education:</strong></p> <ul> <li>Bachelor’s degree in Computer Science, IT, Engineering, or equivalent</li> </ul></div></div></div>