Mohamed
HANNANI, Ph.D.
Summary
Aspiring Data Scientist with a robust background in data analysis, statistical modeling, and machine learning. Proficient in Python, R, SQL, and data manipulation libraries. Adept at developing predictive models with accuracy rates exceeding 90%, facilitating data-driven decision-making for optimized business outcomes. Eager to leverage data to drive innovation in pursuit of a Ph.D. in a research-focused program.
Education
Master of data science
University of Cadi Ayad • Marrakech, Morocco2020 – 2022
Bachelor of Computer Science
University of Cadi Ayad • Marrakech, Morocco2017 – 2020
Experience
Data Engineer & Scientist
Indatacore
Feb 2023 - Present, Casablanca, Morocco
- Led a project team in refining data extraction process for printed or scanned bills, resulting in a 30% reduction in processing time and improving system's accuracy by 15%.
- Teamed up with software engineers to deploy machine learning models into production as part of the SKyID product, leveraging cutting-edge technologies such as Docker and Kubernetes, resulting in a seamless model scalability.
- Orchestrated seamless design and implementation of data cleaning and preprocessing pipelines for internal signature data culminating in an exceptional 95% boost in data integrity and accuracy.
- #Optimization
- #AWS
- #MLOps
- #PyTorch
- #Generative Models
- #Transformers
- #LLMs
- #FastAPI
- #Data Pipeline
- #Apache Spark
Data Scientist
Indatacore
Aug 2022 - Feb 2023, Casablanca, Morocco
- Implemented and developed a state-of-the-art OCR solutions designed to extract information from bank checks, enableding to verify validity of checks with an impressive accuracy rate of 95% eliminating the need for manual verification.
- Applied algorithms to automatically extract essential information from bills document, outstanding an accuracy rate of 98% resulting in reducing manual effort and increasing data processing efficiency by 70%.
- Pioneered enhancement of data science practices and workflows within organization, leading to 20% increase in overall project efficiency culminating in enriched collaboration and faster problem-solving.
- #Pruning
- #MLOps
- #PyTorch
- #ETL
- #Transformers
- #GCP
- #Git
- #FastAPI
- #Data Pipeline
Intern Data Scientist
Indatacore
Mars 2022 - Aug 2022, Casablanca, Morocco
- Accomplished an impressive 97% accuracy in anti-spoofing system, significantly reduced risk of unauthorized access to sensitive systems. leding to 50% dip in security breaches related to spoofed identities, fortifying overall data protection.
- Integrated the anti-spoofing model into a ReactJs application using TensorFlow.js, resulting in 30% reduction in response time for liveness detection, increasing user experience and ensuring robust protection against fraud attempts.
- #Web Scraping
- #Tensorflow.js
- #Machine Learning
- #PyTorch
- #ETL
- #Transformers
- #GCP
- #APIs
- #React.js
- #Apache Spark
Certificates
Natural Language Processing Specialization
Coursera • Jun 2021 - Aug 2021
Machine Learning
Coursera • Apr 2021 - Jun 2021
Apply Generative Adversarial Networks
Coursera • May 2021 - Jul 2021
Skills
- Effective communication skills, collaborating with cross-functional teams to foster project objectives and drive innovation.
- Proficient in Python, Scala, R for data manipulation, analysis, and modeling, and SQL for data pipeline development.
- Familiarity with big data frameworks such as Apache Spark, and Apache Kafka for handling large-scale data processing.
- Proficiency in data engineering tools such as Ansible and Terraform, which allow me to automate infrastructure provisioning, configuration, and deployment processes.
- Experienced in containerization technologies like Docker and container orchestration platforms like Kubernetes, which enable me to deploy, manage, and scale data applications efficiently in various environments.
- Knowledgeable in various database systems, including relational and NoSQL databases.
- Effectively convey complex technical ideas to diverse audiences.
- Proficient in containerization with Docker and container orchestration with Kubernetes.
- Ability to articulate ideas fosters a cohesive and productive work environment.
- Proficient in SQL and database management for data retrieval and manipulation.
- Experienced in conducting experiments and A/B testing.
-
-
Programming:
- #Python
- #R
- #SQL
- #Bash
- #Javascript
-
Deep Learning Frameworks:
- #Keras
- #PyTorch
-
Frameworks:
- #Django
- #Flask
- #Dash
- #Plotly
- #Shiny
- #FastAPI
- #Streamlit
-
Libraries:
- #Pandas
- #Scikit-learn
- #NumPy
- #Matplotlib
- #SciPy
- #Seaborn
-
Data Analytics Tools:
- #Power BI
- #Pentaho
- #Talend
- #IBM Cognos Analytics
-
Developer Tools:
- #Git
- #Docker
- #GitLab
- #Jenkins
- #GitHub
-
Cloud:
- #AWS
- #GCP
- #Azure
-
Infrastructure Management:
- #Terraform
- #Ansible
-
Data Integration:
- #Informatica
-
Cloud Data Warehousing:
- #Snowflake
-
Develpment Environment:
- #Anaconda
- #JupyterLab
-
Big Data Technologies:
- #Apache Spark
- #Hadoop
- #Apache Kafka
- #Apache Flink
- #DataBricks
-
Math:
- #Statistics
- #Probability
- #Linear algebra.
-
Misc:
- #Linux
- #LaTeX
- #Bash.
-
Databases:
- #PostgreSQL
- #MongoDB
- #MySQL
-
Project Management Tools:
- #JIRA.