Shahrukh Khan

Shahrukh Khan

Crailsheim, Baden-Württemberg, Deutschland
11.947 Follower:innen 500+ Kontakte

Info

I am a Senior AI Engineer(Speech and Language Processing) at Yoummday GmbH. My research…

Aktivitäten

Anmelden, um alle Aktivitäten zu sehen

Berufserfahrung

  • yoummday Grafik

    yoummday

    Munich, Bavaria, Germany

  • -

    Munich, Bavaria, Germany

  • -

    Berlin, Germany

  • -

    Saarbrücken, Saarland, Germany

  • -

    Mannheim, Baden-Württemberg, Germany

  • -

    Islamabad, Pakistan

  • -

    Islamabad, Pakistan

  • -

    Islamabad

  • -

    Islamabad

Ausbildung

  • Universität des Saarlandes Grafik

    Universität des Saarlandes

    Fully funded graduate school merit-based scholarship by the International Max Planck Research School for Computer Science (IMPRS-CS) which is a graduate program jointly run by the Max Planck Institute for Informatics (MPI-INF), the Max Planck Institute for Software Systems (MPI-SWS), and the Computer Science Department at Saarland University. The IMPRS for Computer Science (IMPRS-CS) is highly research-oriented.

    Coursework:
    + Machine Learning Privacy
    + Neural Networks Theory and…

    Fully funded graduate school merit-based scholarship by the International Max Planck Research School for Computer Science (IMPRS-CS) which is a graduate program jointly run by the Max Planck Institute for Informatics (MPI-INF), the Max Planck Institute for Software Systems (MPI-SWS), and the Computer Science Department at Saarland University. The IMPRS for Computer Science (IMPRS-CS) is highly research-oriented.

    Coursework:
    + Machine Learning Privacy
    + Neural Networks Theory and Implementation
    + Multimodal Dialogue Systems
    + Machine Learning CyberSecurity
    + Crowdsourcing Natural Language Annotations
    + Software Engineering
    + Data Networks

  • Activities and Societies: Cricket

Bescheinigungen und Zertifikate

Veröffentlichungen

  • Joint Learn: A python package for task-specific weight sharing for sequence classification

    Software Impacts Journal, Elsevier

    Transfer Learning has achieved state-of-the-art results recently in Machine Learning and especially, Natural Language Processing tasks. However, for low resource corpora where there is a lack of pre-trained model checkpoints available. We propose Joint Learn which leverages task-specific weight-sharing for training multiple sequence classification tasks simultaneously and has empirically shown results in more generalizable models. Joint Learn is a PyTorch-based comprehensive toolkit for…

    Transfer Learning has achieved state-of-the-art results recently in Machine Learning and especially, Natural Language Processing tasks. However, for low resource corpora where there is a lack of pre-trained model checkpoints available. We propose Joint Learn which leverages task-specific weight-sharing for training multiple sequence classification tasks simultaneously and has empirically shown results in more generalizable models. Joint Learn is a PyTorch-based comprehensive toolkit for weight-sharing in text classification settings.

    Andere Autor:innen
    Veröffentlichung anzeigen
  • BERT Probe: A python package for probing attention based robustness evaluation of BERT models

    Software Impacts Journal, Elsevier

    Transformer models based on attention-based architectures have been significantly successful in establishing state-of-the-art results in natural language processing (NLP). However, recent work about the adversarial robustness of attention-based models shows that their robustness is susceptible to adversarial inputs causing spurious outputs thereby raising questions about the trustworthiness of such models. In this paper, we present BERT Probe which is a python-based package for evaluating…

    Transformer models based on attention-based architectures have been significantly successful in establishing state-of-the-art results in natural language processing (NLP). However, recent work about the adversarial robustness of attention-based models shows that their robustness is susceptible to adversarial inputs causing spurious outputs thereby raising questions about the trustworthiness of such models. In this paper, we present BERT Probe which is a python-based package for evaluating robustness to attention attribution based on character-level and word-level evasion attacks and empirically quantifying potential vulnerabilities for sequence classification tasks. Additionally, BERT Probe also provides two out-of-the-box defenses against character-level attention attribution-based evasion attacks.

    Andere Autor:innen
    Veröffentlichung anzeigen
  • Hindi/Bengali Sentiment Analysis Using Transfer Learning and Joint Dual Input Learning with Self Attention

    BOHR International Journal of Computer Science (BIJCS)

    Sentiment Analysis typically refers to using natural language processing, text analysis, and computational linguistics to extract effect and emotion-based information from text data. Our work explores how we can effectively use deep neural networks in transfer learning and joint dual input learning settings to effectively classify sentiments and detect hate speech in Hindi and Bengali data. We start by training Word2Vec word embeddings for the Hindi HASOC dataset and Bengali hate speech and…

    Sentiment Analysis typically refers to using natural language processing, text analysis, and computational linguistics to extract effect and emotion-based information from text data. Our work explores how we can effectively use deep neural networks in transfer learning and joint dual input learning settings to effectively classify sentiments and detect hate speech in Hindi and Bengali data. We start by training Word2Vec word embeddings for the Hindi HASOC dataset and Bengali hate speech and then train LSTM and subsequently, employ parameter sharing-based transfer learning to Bengali sentiment classifiers by reusing and fine-tuning the trained weights of Hindi classifiers with both classifiers being used as the baseline in our study. Finally, we use BiLSTM with self-attention in a joint dual input learning setting where we train a single neural network on the Hindi and Bengali datasets simultaneously using their respective embeddings.

    Andere Autor:innen
    Veröffentlichung anzeigen
  • White-Box Attacks on Hate-speech BERT Classifiers in German with Explicit and Implicit Character Level Defense

    BOHR International Journal of Computer Science (BIJCS)

    Attention-based Transformer models have achieved state-of-the-art results in natural language processing(NLP). However, recent work shows that the underlying attention mechanism can be exploited by adversaries to craft malicious inputs designed to induce spurious outputs, thereby harming model performance and trustworthiness. Unlike in the vision domain, the literature examining neural networks under adversarial conditions in the NLP domain is limited and most of it focuses mainly on the…

    Attention-based Transformer models have achieved state-of-the-art results in natural language processing(NLP). However, recent work shows that the underlying attention mechanism can be exploited by adversaries to craft malicious inputs designed to induce spurious outputs, thereby harming model performance and trustworthiness. Unlike in the vision domain, the literature examining neural networks under adversarial conditions in the NLP domain is limited and most of it focuses mainly on the English language. In this paper, we first analyze the adversarial robustness of Bidirectional Encoder Representations from Transformers (BERT) models for German datasets. Second, we introduce two novel NLP attacks. Namely, a character-level and word-level attacks, both of which utilize attention scores to calculate where to inject character-level and word-level noise, respectively. Finally, we present two defense strategies against the attacks above. The first implicit character-level defense is a variant of adversarial training, which trains a new classifier capable of abstaining/rejecting certain (ideally adversarial) inputs. The other explicit character-level defense learns a latent representation of the complete training data vocabulary and then maps all tokens of an input example to the same latent space, enabling the replacement of all out of vocabulary tokens with the most similar in-vocabulary tokens based on the cosine similarity metric.

    Andere Autor:innen
    Veröffentlichung anzeigen

Patente

  • System and Methods for Real Time Identification of Change-Induced Incidents in AIOPs Applications

    US 96279747 / P202007865PK01

    An embodiment includes determining if a new incident report of a new incident matches any resolved incident reports associated with resolved incidents. The embodiment performs a first classification operation on the new incident report to determine if the new incident report is likely to be similar to any resolved incident reports associated with resolved incidents. The embodiment also performs a second classification operation on the new incident report to generate a ranked list of changes…

    An embodiment includes determining if a new incident report of a new incident matches any resolved incident reports associated with resolved incidents. The embodiment performs a first classification operation on the new incident report to determine if the new incident report is likely to be similar to any resolved incident reports associated with resolved incidents. The embodiment also performs a second classification operation on the new incident report to generate a ranked list of changes that are likely to be similar to the new incident report. The embodiment outputs the ranked list of changes to an incident manager for evaluation, then receives an input representative of a selected change from among the ranked list of changes responsible for causing the new incident. The embodiment revises the new incident report to include a reference to the selected change.

    Andere Erfinder:innen
    Patent anzeigen

Projekte

  • A Novel Approach for text-to-SQL: Dual Transformers Approach

    Kuan Xu et. al published their work “SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising” with the main hypothesis that arguing that current transformer based Sequence to Sequence architectures are powerful enough to overcome the shortcomings as stated “SQL queries with different clause order may have exact same semantic meaning and return same results by execution. The token interchangeability may confusion model based on S2S generation. Second, the grammar constraint induced…

    Kuan Xu et. al published their work “SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising” with the main hypothesis that arguing that current transformer based Sequence to Sequence architectures are powerful enough to overcome the shortcomings as stated “SQL queries with different clause order may have exact same semantic meaning and return same results by execution. The token interchangeability may confusion model based on S2S generation. Second, the grammar constraint induced by structural logical form is ignored during auto-aggressive decoding, therefore the model may predict SQL with invalid logical form. Third, schema linking, which has been suggested to be the crux of the text-to-SQL task, is not specially addressed by vanilla S2S model.” for solving text-to-SQL task using a seq2seq architecture.
    I propose an additional transformer block on top of this S2S approach which can prove to be very useful where there is very little lexical overlap between the input sequence and the schema or the data present in the table. Moreover, this would enable vector-based similarity over generated SQL’s entities for performing the entity linking.

    Projekt anzeigen
  • BERT for Chemical Industrial Domain

    Pre-training a language model for the Chemical industry, while working at Recobo.ai (Recobo: Regulatory Compliance Bot, A neural search and question-answering solution for the Pharmaceutical and Chemical industry), I encountered a plethora of domain-specific Chemical industry data, as it is a heavily regularized industry. While fine-tuning downstream tasks from checkpoints of the general BERT model gave us above-average results, however, the consensus within our team was that there was a lot of…

    Pre-training a language model for the Chemical industry, while working at Recobo.ai (Recobo: Regulatory Compliance Bot, A neural search and question-answering solution for the Pharmaceutical and Chemical industry), I encountered a plethora of domain-specific Chemical industry data, as it is a heavily regularized industry. While fine-tuning downstream tasks from checkpoints of the general BERT model gave us above-average results, however, the consensus within our team was that there was a lot of room for improvement.

    Projekt anzeigen
  • Multilingual PDF to Text

    A python library for extracting text from PDFs without losing the formatting of the PDF content.

    Projekt anzeigen
  • Multi-Task Learning Transformers

    A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets.

    Projekt anzeigen
  • Hindi/Bengali Sentiment Analysis Using Transfer Learning and Joint Dual Input Learning with Self Attention

    Sentiment Analysis typically refers to using natural language processing, text analysis and computational linguistics to extract affect and emotion based information from text data. Our work explores how we can effectively use deep neural networks in transfer learning and joint dual input learning settings to effectively classify sentiments and detect hate speech in Hindi and Bengali data. We start by training Word2Vec word embeddings for Hindi \textbf{HASOC dataset} and Bengali hate speech and…

    Sentiment Analysis typically refers to using natural language processing, text analysis and computational linguistics to extract affect and emotion based information from text data. Our work explores how we can effectively use deep neural networks in transfer learning and joint dual input learning settings to effectively classify sentiments and detect hate speech in Hindi and Bengali data. We start by training Word2Vec word embeddings for Hindi \textbf{HASOC dataset} and Bengali hate speech and then train LSTM and subsequently, employ parameter sharing based transfer learning to Bengali sentiment classifiers by reusing and fine-tuning the trained weights of Hindi classifiers with both classifier being used as baseline in our study. Finally, we use BiLSTM with self attention in joint dual input learning setting where we train a single neural network on Hindi and Bengali dataset simultaneously using their respective embeddings.

    Projekt anzeigen
  • LinkedIn Jobs Scraper

    Scrape public jobs postings from LinkedIn in native python without selenium or any headless browser.

    Projekt anzeigen
  • Top 20 movies on popularity and Ratings

    In this project I have used the Kaggle's TMDB dataset where I have done analysis to compute that which are the top movies, in order to compute the top movies I have used a hybrid metric which used IMDB like weighted rating and movie's popularity score, so in order to achieve this, I first compute IMDB like score using movie rating and rating counts then use these to compute final score to get top movies.

    Projekt anzeigen
  • Classifying Dog Breeds Using CNNs in PyTorch

    This project is inspired by the Kaggle Dog Breed competition. The goal of this project is to build a pipeline that could be used in a web or mobile app to process real-world images. The pipeline must be able to:

    Detect whether an image contains a dog or human, and distinguish between the two.
    If given an image of a dog, the model predicts the canine’s breed with at least 60% accuracy. Random chance is only 0.75% since there are 133 dog breeds (classes) in the dataset.
    If given an…

    This project is inspired by the Kaggle Dog Breed competition. The goal of this project is to build a pipeline that could be used in a web or mobile app to process real-world images. The pipeline must be able to:

    Detect whether an image contains a dog or human, and distinguish between the two.
    If given an image of a dog, the model predicts the canine’s breed with at least 60% accuracy. Random chance is only 0.75% since there are 133 dog breeds (classes) in the dataset.
    If given an image of a human, the model identifies the dog breeds the person most resembles.

    Projekt anzeigen
  • Seinfeld TV scripts generation using RNNs in PyTorch

    Seinfeld TV scripts generation using RNNs in PyTorch
    In this project, I generated my own Seinfeld TV scripts using RNNs. I used part of the Seinfeld dataset of scripts from 9 seasons. The Neural Network I built generated a new fake TV script, based on patterns it recognizes in this training data.

    Projekt anzeigen
  • Predicting Bike Sharing patterns


    Predicting Bike Sharing Patterns using Neural Network
    In this project I have used Neural Network to predict number of bikes needed by the Bike sharing company in their stock based on factors like location, weather etc. Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent a bike from a particular position and return back at another position…


    Predicting Bike Sharing Patterns using Neural Network
    In this project I have used Neural Network to predict number of bikes needed by the Bike sharing company in their stock based on factors like location, weather etc. Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent a bike from a particular position and return back at another position. Currently, there are about over 500 bike-sharing programs around the world which is composed of over 500 thousands bicycles. Today, there exists great interest in these systems due to their important role in traffic, environmental and health issues. The data comes from the UCI Machine Learning Database.

    Projekt anzeigen
  • Microfinance Loan Credit Scoring

    The dairy farmers provide milk to the food company. The food company in return makes weekly payments to the farmers which are routed through the bank. The bank is offering loans to the dairy farmers against these payments. Duration of loan is 13 weeks. After a farmer has acquired loan, it'll be repaid in weekly installments. These installments are deducted from payments of the food company to the farmer over the course of next 13 weeks. Challenge: Given historical 1 year (52 weeks) of payment…

    The dairy farmers provide milk to the food company. The food company in return makes weekly payments to the farmers which are routed through the bank. The bank is offering loans to the dairy farmers against these payments. Duration of loan is 13 weeks. After a farmer has acquired loan, it'll be repaid in weekly installments. These installments are deducted from payments of the food company to the farmer over the course of next 13 weeks. Challenge: Given historical 1 year (52 weeks) of payment data, you role as analyst is to identify Which farmers are most suitable to target for loan product X? In other words, which farmers are likely to have consistent payments in next 13 weeks? How much loan should be offered to each farmer?

    Projekt anzeigen
  • E Crime File Management System

    This project was made in JAVA EE for the web part and phone gap & web services in JAVA EE for the android part. Its main features were:
    Online crime reporting
    Crime statistics
    Crime investigation details
    Offline chat with police
    Record management of Wanted Criminals
    Record Management of Missing Persons

    Andere Mitarbeiter:innen

Auszeichnungen/Preise

  • IMPRS-CS Scholarship

    Max-Planck-Institut für Informatik

    Fully funded graduate school merit-based scholarship of graduate school, by the International Max Planck Research School for Computer Science (IMPRS-CS) which is a graduate program jointly run by the Max Planck Institute for Informatics (MPI-INF), the Max Planck Institute for Software Systems (MPI-SWS), and the Computer Science Department at Saarland University. The IMPRS for Computer Science (IMPRS-CS) is highly research-oriented.

  • Facebook Pytorch Udacity Scholarship

    Udacity

  • Double Gold Medalist

    Comsats Institute of Technology, Islamabad

    I secured the top position in Comsats Institute of Technology Islamabad campus, as well as across all other campuses hence I got Institute gold medal as well as campus gold medal.

Prüfungsergebnisse

  • IELTS(Academic)

    Prüfungsergebnis: 8.0/9.0

Sprachen

  • English

    -

  • German

    -

  • Urdu

    -

Erhaltene Empfehlungen

Weitere Aktivitäten von Shahrukh Khan

Shahrukh Khans vollständiges Profil ansehen

  • Herausfinden, welche gemeinsamen Kontakte Sie haben
  • Sich vorstellen lassen
  • Shahrukh Khan direkt kontaktieren
Mitglied werden. um das vollständige Profil zu sehen

Weitere ähnliche Profile

Weitere Mitglieder namens Shahrukh Khan in Deutschland

Entwickeln Sie mit diesen Kursen neue Kenntnisse und Fähigkeiten