> job detail
A
👽Other
Senior Data Engineer, Mandarin
A5 Labs · San Jose, United States of America
// classified as
Other (Adjacent or hard to classify.)
posted
1d ago
location
San Jose, United States of America
languages
python, sql
tools
aws, databricks, elasticsearch
> stack
pythonsqlawsdatabrickselasticsearchhadoophivesparkairflow
> education
masters
> description
<h2 dir="ltr" style="line-height:1.3800000000000001;margin-top:12pt;margin-bottom:12pt;"><span style="font-size:13.999999999999998pt;font-family:Arial,sans-serif;color:#434343;background-color:transparent;font-weight:700;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;"><u>Job Description</u></span><span style="font-size:13.999999999999998pt;font-family:Arial,sans-serif;color:#434343;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;"><u>:</u></span></h2><p dir="ltr" style="line-height:1.7999999999999998;background-color:#ffffff;margin-top:12pt;margin-bottom:0pt;padding:0pt 0pt 12pt 0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">We are looking for a candidate with 5+ years of experience in a Data Engineer role, to join our growing team of analytics experts. The hire will be responsible for expanding and optimizing our data and data pipeline architecture, as well as optimizing data flow and collection for cross functional teams. The Data Engineer will support the company on data initiatives and will ensure optimal data delivery architecture is consistent throughout ongoing projects,proficiency in both English and Manderian is required to facilitate effective communication with our diverse team and international stakeholders. </span></p><h3 dir="ltr" style="line-height:1.6615384615384612;background-color:#ffffff;margin-top:0pt;margin-bottom:12pt;"><span style="font-size:13.999999999999998pt;font-family:Arial,sans-serif;color:#434343;background-color:transparent;font-weight:700;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;"><strong><u>Responsibilities:</u></strong></span></h3><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:9pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Create and maintain optimal data pipeline architecture.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Assemble large, complex data sets that meet functional / non-functional business requirements.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using AWS ‘big data’ and SQL technologies.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Build analytics tools that utilize the data pipeline to provide actionable insights into customer behavior, operational efficiency and other key business performance metrics.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Keep our data separated and secure across national boundaries through multiple data centers and AWS regions.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:9pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Work with data and analytics experts to strive for greater functionality in our data systems.</span></p><h3 dir="ltr" style="line-height:1.6615384615384612;background-color:#ffffff;margin-top:12pt;margin-bottom:12pt;"><span style="font-size:13.999999999999998pt;font-family:Arial,sans-serif;color:#434343;background-color:transparent;font-weight:700;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Required Skills:</span></h3><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Minimum of 5+ years of hands-on experience with big data related ecosystems and tools such as Databricks, Snowflake</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Minimum of 5+ years of hands-on experience with big data related tools such as Spark, Hive, Hadoop, EMR.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;padding:9pt 0pt 0pt 0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Minimum of 5+ </span><span style="font-size:11pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">years of hands-on experience with</span><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;"> SQL/NonSQL knowledge and experience.</span></p><p dir="ltr" style="line-height:1.3800000000000001;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Minimum of 3+</span><span style="font-size:11pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;"> years of hands-on experience with AWS services such as </span><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">EC2, ECS, MSK, RDS, Redshift</span></p><p dir="ltr" style="line-height:1.3800000000000001;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Minimum of 3+</span><span style="font-size:11pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;"> years python development experience</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Hands-on experience building and optimizing ‘big data’ data pipelines, architectures and data sets.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Hands-on experience building and optimizing streaming/batch data pipelines.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Experience of working on large scale systems </span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Strong analytic skills related to working with structured/unstructured datasets.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Successful history of manipulating, processing and extracting value from large disconnected datasets.</span></p><p dir="ltr" style="line-height:1.7217391304347824;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Knowledgeable about data modeling, data access, and data storage techniques.</span></p><p dir="ltr" style="line-height:1.7217391304347824;margin-top:0pt;margin-bottom:18pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Degree in Computer Science, Statistics, Informatics, Information Systems or another quantitative field, or similar field; a Master’s is a plus</span></p><p dir="ltr" style="line-height:1.7217391304347824;margin-top:0pt;margin-bottom:18pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Fluent Manderian speaking skill is a must</span></p><h3 dir="ltr" style="line-height:1.3800000000000001;margin-top:12pt;margin-bottom:12pt;"><span style="font-size:13.999999999999998pt;font-family:Arial,sans-serif;color:#434343;background-color:transparent;font-weight:700;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Preferred Qualifications</span><span style="font-size:13.999999999999998pt;font-family:Arial,sans-serif;color:#434343;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">:</span></h3><p dir="ltr" style="line-height:1.3800000000000001;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Familiar with Indexing solutions such as Elasticsearch, Solr</span></p><p dir="ltr" style="line-height:1.3800000000000001;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Familiar with streaming systems and tools such as Spark-streaming, Kafka-stream, Flink, etc</span></p><p dir="ltr" style="line-height:1.3800000000000001;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Familiar with Workflow management tools such as Airflow, Apache Nifi, etc </span></p><p dir="ltr" style="line-height:1.3800000000000001;background-color:#ffffff;margin-top:0pt;margin-bottom:0pt;"><span style="font-size:11.5pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Good understanding of machine learning/numerical and analytical skills</span></p>