
SARAH
Data Engineer and Analyst - AI Enthusiast
Female28 y/oData/AI/Machine Learning/Data ScientistLive in ShanghaiNationality Morocco
Share
Summary
Passionate Data Engineer and Analyst with a Master's in Software Engineering from Shanghai Jiao Tong University and solid experience in ETL pipelines, data warehousing, and real-time analytics. Skilled in Python, SQL, PostgreSQL, Dagster, Airflow, and data visualization. Previously contributed to aviation and product data projects, with a growing focus on AI and machine learning. Eager to apply data-driven solutions to real-world problems and contribute to innovative teams in onsite, hybrid, or remote environments.
Work experience
Data Engineer
Shanghai Feihao Property Management Co., Ltd.
2023.09-Current(2 years)
Led the overall design of databases to store, organize, and retrieve information with automated ETL
pipelines using Python, RDBMS, SQL, Apache Airflow and Kafka, besides demonstrating strong
communication skills in client presentations and negotiations, contributing to business growth and part-
nerships.
Consultant Data Engineer
Admiral
2022.11-2023.05(7 months)
November 2022 - Mai 2023
Built data pipelines that integrated information from diferent sources, including cleaning, transforming,
and loading data into a central data warehouse. Automated and scheduled these ETL processes for
consistent data aggregation. (Python, SQL, Hadoop, Tableau, Spark, Kafka)
Machine Learning Intern
Freebeat
2020.07-2021.01(7 months)
Participated in building, training, testing, and deploying a complex recurrent neural network for Music
Audio Analysis and Pattern Recognition using Python, with Tensorflow and Keras frameworks.
Artificial Intelligence Researcher - Academic
Shanghai Jiao Tong University - SEIEE
2019.09-2020.06(10 months)
Worked with a team of 1 Ph.D. and 2 Master’s students on a computer vision task to build a classifi-
cation model serving the medical field using Convolutional Neural Networks (CNNs), under the
supervision of Dr. Yao Jianguo. The model achieved high accuracy in diagnosing arrhythmias from ECG
time series.
Projects
ETL for aviation data
Data Engineer
2024.10-2025.03(6 months)
Developed an ETL pipeline for data from different sources like APIs, DBs, and flat files, involving data cleaning, normalization, and standardization processes, loaded the data into a centralized data warehouse, and automated and scheduled the overall pipeline. Python, Pandas, SQLAlchemy, PostgreSQL, SQL, Dagster, Github actions, CI/CD Pipeline Orchestration.
Full-stack web application
developer
2023.10-2024.01(4 months)
Developed a full-stack web application for the presentation of the company and user registration functionality
as a training task, handling the front end, back end, database management, and containerization.
React, Node.js, Express, Postgres, JavaScript, Docker.
Machine learning and deep learning on audio data
Assistant AI engineer
2020.08-2020.10(3 months)
Optimized a deep learning model for music beat and downbeat detection using an Artificial Neural Networks, combined with a multi-resolution spectrogram processor. Python, SciPy/NumPy. The developed deep learning model outperformed existing models by 12% in accuracy, was successfully implemented in production, and contributed to attracting more investors to the company’s product.
AI on images
Developer
2018.12-2019.02(3 months)
Developed a model to visualize image data and its semantic relationships, for tasks like object detection and image annotation. Python, Keras.
E-commerce mobile application
Developer
2017.05-2017.06(2 months)
Developed, with a team of 4, a mobile Android application for e-commerce, enabling user account registration,
login functionality, product input with detailed specifications, and exploration of product catalogs. Java, SQLite, RESTful API, Android Studio.
Educational experience
Shanghai Jiao Tong University
Software Engineering.
2019.09-2022.09(3 years)
Master of Software Engineering. Overall GPA: 3.5/4.00
Relevant courses: Algorithm Analysis and Theory, Web search and mining, Data visualization, Computer networks, Internet of things.
Scientific research direction: Deep Learning, Artificial Intelligence, Machine Learning.
Hassan II University of Casablanca
Mathematics & Computer Science.
2014.09-2017.06(3 years)
Bachelor of Science in Mathematics & Computer Science.
Relevant courses: Data Structures & Algorithms, Database Management Systems, Oriented Object Programming, Web Development, Networks programming, Systems programming, Functional programming, Mobile App Development.
Languages
English
Proficient
French
Native
Arabic
Native
Chinese (Mandarin)
Normal
Certificates
Introduction to Relational Databases
2023.08
Convolutional Neural Networks certificate
2021.05
Sequences, Time Series and Prediction
2020.05
Neural Networks and Deep Learning certificate
2020.02
Machine Learning
2020.01
IELTS
2019.03
Chinese language proficiency HSK3
2018.06
Skills
International Trading as a part time
Content Creation
Driving License in China
Files
Resume Search
Nationality
Job category
City or country
Sort by
Contact way
86****0130
sa**@**fr
*****

Membership will unlock the resume
Also view