Hussein Awala

Hussein Awala

Châtillon, Île-de-France, France
2 k abonnés + de 500 relations

À propos

As a Senior Data Engineer at Voodoo, I am a skilled expert in the realm of big data…

Activité

S’inscrire pour voir toute l’activité

Expérience

  • Graphique Voodoo

    Voodoo

    Paris, Île-de-France, France

  • -

    Paris, Île-de-France, France

  • -

    Paris, Île-de-France, France

  • -

    Paris, Île-de-France, France

  • -

    Paris, Île-de-France, France

  • -

    Paris, Île-de-France, France

  • -

    Paris Area, France

  • -

    Paris Area, France

  • -

    Paris 15, Île-de-France, France

Formation

  • Graphique Grenoble INP - Ensimag

    National School of Computer Science and Applied Mathematics of Grenoble

    -

    MOSIG DS program with courses:
    Data Challenges
    Data management in large-scale distributed systems
    Advanced learning models
    Convex and Distributed Optimization
    High performance computing for mathematical models
    Fundamentals of probabilistic data mining
    Machine Learning fundamentals
    Advanced algorithms for machine learning and data mining
    Information visualization
    Distributed Systems

  • -

  • -

Licences et certifications

Cours

  • Advanced Database Aspects

    INFO450

  • Advanced Operating Systems

    INFO403

  • Advanced Web Technologies

    INFO446

  • Artificial Intelligence and Knowledge Representation

    INFO444

  • Cloud Computing

    INFO448

  • Event-Driven Programming and Graphical User Interface

    INFO445

  • Image , Video and Audio

    INFO449

  • Machine Learning

    INFO447

  • Mobile Applications Development

    INFO438

  • Networks Interconnection and Security

    INFO402

  • Programming of Distributed Applications

    INFO408

Projets

  • Spark On K8S

    A Python package to submit and manage Apache Spark applications on Kubernetes.

  • Apache Airflow

    Apache Airflow is a platform to programmatically author, schedule, and monitor workflows.
    I'm a committer, a member of the project management committee (PMC), and a member of the security team in the project.

  • Détecteur de panneaux photovoltaïques

    -

    Un projet qui utilise des modèles de deep learning pour détecter les panneaux photovoltaïques sur les toits des immeubles par traiter les images satellites du service Google map, ce projet intéresse les sociétés d'assurance où la présence de ces panneaux photovoltaïques affecte le coût de l'assurance.

  • Data4Risk workflow

    -

    Une nouvelle architecture du système de traitement des fichiers météorologiques, déployée dans un cluster Kubernetes et gérée par Argo Workflows, visant à traiter tous les fichiers dans tous les étapes du pipeline en parallèle dans un environment évolutif qui assure la tolérance aux pannes.

    See project
  • Evaluation of PySpark performance

    -

    In this project, we have to test all pyspark functions, with different numbers of workers, threads, type of processing(memory, hardisk or both), with caching and without caching our RDDs, to find the best ways to process the data using Spark cluster.
    We use in this project a big dataset(over 40GB), this dataset represents 29 days
    of activity in a large scale Google machine (a cluster) featuring about 12.5k machines. It includes information about the jobs executed on this cluster during…

    In this project, we have to test all pyspark functions, with different numbers of workers, threads, type of processing(memory, hardisk or both), with caching and without caching our RDDs, to find the best ways to process the data using Spark cluster.
    We use in this project a big dataset(over 40GB), this dataset represents 29 days
    of activity in a large scale Google machine (a cluster) featuring about 12.5k machines. It includes information about the jobs executed on this cluster during that period as well as information about the corresponding resource usage.

  • Atmo Challenge

    -

    Design and implementation of some machine learning models(RNN, Random Forest, ADABOOST,...) to predict pollutant in Grenoble area, in collaboration with ATMO Auvergne-Rhône Alpes.
    In this project, we have data from 107 different stations distributed on different French regions, they are rows contain values measured in this stations each hour during 5 years(2012 to 2016), and data for another prediction model, we can use it to renforce our prediction model.

  • Parallel version of agglomerative clustering algorithm

    -

    Design and implementation a parallel version of the hierarchical clustering algorithm(agglomerative), using different HPC libraries(openMP,MPI), to speed up this algorithm on high performance computer.

  • Sensors Network

    -

    • Creating a model of Sensors and Routers, and developing the system using distributed programming of JAVA packages.
    • Developing another simulation system to auto-generate data and create problems in the first system to test it.

  • Hadoop Cluster

    -

    Implementation of Hadoop cluster and write Mapreduce code to extract some result from a huge log file of an international enterprise.

  • News Application

    -

    Developing a customizable android application, this application is a core of news application, consists from customizable categories, views, notifications and users, linked to a cloud database(NoSQL).

  • Java IDE

    -

    Design and implementation of a program (IDE) that allows the user to create a GUI of his program using the "Drag Drop" method. (JAVA SWING)

  • DBpedia: Movies Explore

    -

    • Developing a website to explore DBpedia data using sparql and PHP and JavaScript.
    • This website aims to understand the concept of ontology structure, and how the search algorithms find the data in a huge measure of data as a graph.

  • Institute of Doctorate Online System

    -

    Designing and developing a project to organise the work in the institute
    of doctorate at Lebanese University, like instruments booking,
    appointment for meeting, saving profiles,Monitor staff attendance,
    check the stock of material.

  • Calculation Server

    -

    Advanced operating system project containing a server that organize work between clients and the other computing server.

  • Insurance Company’s system

    -

    Designing and implementation of an insurance company’s system using JAVA and MYSQL.

  • Lebanese E-Election

    -

    Designing and developing a project to transfer the election Lebanese to e-election, using C# and MSSQL, with all steps from candidature to result calculating with multi distributed programme connected using secure network.

  • Twitter: Data Indexing and Search

    -

    • Developing a project to organize data from Twitter(accounts, circles,
    hashtags...) in some proper data structures.
    • The goal is to speed up searches in this data and make the links
    between the accounts more noticable.

Prix et distinctions

  • 4th place LUCPC 2018

    Lebanese University

    4th place in Lebanese University Collegiate Programming Contest 2018

  • Relentless Programmer certificate

    ICPC & ACPC directors

    9th place in Lebanese Collegiate Programming Contest 2017 and qualify to Arabic Collegiate Programming Contest 2017

  • 2nd place LUCPC 2017

    Lebanese University

    2nd place in Lebanese University Collegiate Programming Contest 2017

Langues

  • Arabic

    Bilingue ou langue natale

  • English

    Capacité professionnelle générale

  • French

    Capacité professionnelle complète

Plus d’activités de Hussein

Voir le profil complet de Hussein

  • Découvrir vos relations en commun
  • Être mis en relation
  • Contacter Hussein directement
Devenir membre pour voir le profil complet

Autres profils similaires

Autres personnes nommées Hussein Awala

Ajoutez de nouvelles compétences en suivant ces cours