Mohamed HANNANI, Ph.D.

Summary

Aspiring Data Scientist with a robust background in data analysis, statistical modeling, and machine learning. Proficient in Python, R, SQL, and data manipulation libraries. Adept at developing predictive models with accuracy rates exceeding 90%, facilitating data-driven decision-making for optimized business outcomes. Eager to leverage data to drive innovation in pursuit of a Ph.D. in a research-focused program.

Education

Master of data science

University of Cadi Ayad • Marrakech, Morocco

2020 – 2022

Bachelor of Computer Science

University of Cadi Ayad • Marrakech, Morocco

2017 – 2020

Experience

Data Engineer & Scientist

Indatacore
Feb 2023 - Present, Casablanca, Morocco
  • Led a project team in refining data extraction process for printed or scanned bills, resulting in a 30% reduction in processing time and improving system's accuracy by 15%.
  • Teamed up with software engineers to deploy machine learning models into production as part of the SKyID product, leveraging cutting-edge technologies such as Docker and Kubernetes, resulting in a seamless model scalability.
  • Orchestrated seamless design and implementation of data cleaning and preprocessing pipelines for internal signature data culminating in an exceptional 95% boost in data integrity and accuracy.
  • #Optimization
  • #AWS
  • #MLOps
  • #PyTorch
  • #Generative Models
  • #Transformers
  • #LLMs
  • #FastAPI
  • #Data Pipeline
  • #Apache Spark

Data Scientist

Indatacore
Aug 2022 - Feb 2023, Casablanca, Morocco
  • Implemented and developed a state-of-the-art OCR solutions designed to extract information from bank checks, enableding to verify validity of checks with an impressive accuracy rate of 95% eliminating the need for manual verification.
  • Applied algorithms to automatically extract essential information from bills document, outstanding an accuracy rate of 98% resulting in reducing manual effort and increasing data processing efficiency by 70%.
  • Pioneered enhancement of data science practices and workflows within organization, leading to 20% increase in overall project efficiency culminating in enriched collaboration and faster problem-solving.
  • #Pruning
  • #MLOps
  • #PyTorch
  • #ETL
  • #Transformers
  • #GCP
  • #Git
  • #FastAPI
  • #Data Pipeline

Intern Data Scientist

Indatacore
Mars 2022 - Aug 2022, Casablanca, Morocco
  • Accomplished an impressive 97% accuracy in anti-spoofing system, significantly reduced risk of unauthorized access to sensitive systems. leding to 50% dip in security breaches related to spoofed identities, fortifying overall data protection.
  • Integrated the anti-spoofing model into a ReactJs application using TensorFlow.js, resulting in 30% reduction in response time for liveness detection, increasing user experience and ensuring robust protection against fraud attempts.
  • #Web Scraping
  • #Tensorflow.js
  • #Machine Learning
  • #PyTorch
  • #ETL
  • #Transformers
  • #GCP
  • #APIs
  • #React.js
  • #Apache Spark

Certificates

Natural Language Processing Specialization

Coursera • Jun 2021 - Aug 2021

Machine Learning

Coursera • Apr 2021 - Jun 2021

Apply Generative Adversarial Networks

Coursera • May 2021 - Jul 2021

Skills

  • Effective communication skills, collaborating with cross-functional teams to foster project objectives and drive innovation.
  • Proficient in Python, Scala, R for data manipulation, analysis, and modeling, and SQL for data pipeline development.
  • Familiarity with big data frameworks such as Apache Spark, and Apache Kafka for handling large-scale data processing.
  • Proficiency in data engineering tools such as Ansible and Terraform, which allow me to automate infrastructure provisioning, configuration, and deployment processes.
  • Experienced in containerization technologies like Docker and container orchestration platforms like Kubernetes, which enable me to deploy, manage, and scale data applications efficiently in various environments.
  • Knowledgeable in various database systems, including relational and NoSQL databases.
  • Effectively convey complex technical ideas to diverse audiences.
  • Proficient in containerization with Docker and container orchestration with Kubernetes.
  • Ability to articulate ideas fosters a cohesive and productive work environment.
  • Proficient in SQL and database management for data retrieval and manipulation.
  • Experienced in conducting experiments and A/B testing.
___
    • Programming:
    • #Python
    • #R
    • #SQL
    • #Bash
    • #Javascript
      Deep Learning Frameworks:
    • #Keras
    • #PyTorch
      Frameworks:
    • #Django
    • #Flask
    • #Dash
    • #Plotly
    • #Shiny
    • #FastAPI
    • #Streamlit
      Libraries:
    • #Pandas
    • #Scikit-learn
    • #NumPy
    • #Matplotlib
    • #SciPy
    • #Seaborn
      Data Analytics Tools:
    • #Power BI
    • #Pentaho
    • #Talend
    • #IBM Cognos Analytics
      Developer Tools:
    • #Git
    • #Docker
    • #GitLab
    • #Jenkins
    • #GitHub
      Cloud:
    • #AWS
    • #GCP
    • #Azure
      Infrastructure Management:
    • #Terraform
    • #Ansible
      Data Integration:
    • #Informatica
      Cloud Data Warehousing:
    • #Snowflake
      Develpment Environment:
    • #Anaconda
    • #JupyterLab
      Big Data Technologies:
    • #Apache Spark
    • #Hadoop
    • #Apache Kafka
    • #Apache Flink
    • #DataBricks
      Math:
    • #Statistics
    • #Probability
    • #Linear algebra.
      Misc:
    • #Linux
    • #LaTeX
    • #Bash.
      Databases:
    • #PostgreSQL
    • #MongoDB
    • #MySQL
      Project Management Tools:
    • #JIRA.