„I worked with Shahrukh on developing a real time incident resolution system for AIOps. The zeal, knowledge and experience Shahrukh brought were key to success of this project. I was able to witness Shahrukh's interest to experiment and learn new stuff and go above and beyond the basic requirements. I wish Shahrukh success in his grad school. He will be a valuable resource to the organizations and projects he becomes a part of.“
Shahrukh Khan
Crailsheim, Baden-Württemberg, Deutschland
11.947 Follower:innen
500+ Kontakte
Info
I am a Senior AI Engineer(Speech and Language Processing) at Yoummday GmbH. My research…
Aktivitäten
-
After two incredible years at #Accenture and a decade of living in Germany, the time has come for me and my family to embrace a new chapter. This…
After two incredible years at #Accenture and a decade of living in Germany, the time has come for me and my family to embrace a new chapter. This…
Beliebt bei Shahrukh Khan
-
if you're serious about serving models in production then you should be using something that you can debug easily and scale to high throughput…
if you're serious about serving models in production then you should be using something that you can debug easily and scale to high throughput…
Beliebt bei Shahrukh Khan
-
We are going full circle: - we started Hugging Face with Thomas and Clem by following that same class (Stanford CS224N) taught by Richard Socher…
We are going full circle: - we started Hugging Face with Thomas and Clem by following that same class (Stanford CS224N) taught by Richard Socher…
Beliebt bei Shahrukh Khan
Berufserfahrung
Ausbildung
-
Universität des Saarlandes
–
Fully funded graduate school merit-based scholarship by the International Max Planck Research School for Computer Science (IMPRS-CS) which is a graduate program jointly run by the Max Planck Institute for Informatics (MPI-INF), the Max Planck Institute for Software Systems (MPI-SWS), and the Computer Science Department at Saarland University. The IMPRS for Computer Science (IMPRS-CS) is highly research-oriented.
Coursework:
+ Machine Learning Privacy
+ Neural Networks Theory and…Fully funded graduate school merit-based scholarship by the International Max Planck Research School for Computer Science (IMPRS-CS) which is a graduate program jointly run by the Max Planck Institute for Informatics (MPI-INF), the Max Planck Institute for Software Systems (MPI-SWS), and the Computer Science Department at Saarland University. The IMPRS for Computer Science (IMPRS-CS) is highly research-oriented.
Coursework:
+ Machine Learning Privacy
+ Neural Networks Theory and Implementation
+ Multimodal Dialogue Systems
+ Machine Learning CyberSecurity
+ Crowdsourcing Natural Language Annotations
+ Software Engineering
+ Data Networks -
–
Activities and Societies: Cricket
Bescheinigungen und Zertifikate
Veröffentlichungen
-
Joint Learn: A python package for task-specific weight sharing for sequence classification
Software Impacts Journal, Elsevier
Transfer Learning has achieved state-of-the-art results recently in Machine Learning and especially, Natural Language Processing tasks. However, for low resource corpora where there is a lack of pre-trained model checkpoints available. We propose Joint Learn which leverages task-specific weight-sharing for training multiple sequence classification tasks simultaneously and has empirically shown results in more generalizable models. Joint Learn is a PyTorch-based comprehensive toolkit for…
Transfer Learning has achieved state-of-the-art results recently in Machine Learning and especially, Natural Language Processing tasks. However, for low resource corpora where there is a lack of pre-trained model checkpoints available. We propose Joint Learn which leverages task-specific weight-sharing for training multiple sequence classification tasks simultaneously and has empirically shown results in more generalizable models. Joint Learn is a PyTorch-based comprehensive toolkit for weight-sharing in text classification settings.
Andere Autor:innenVeröffentlichung anzeigen -
BERT Probe: A python package for probing attention based robustness evaluation of BERT models
Software Impacts Journal, Elsevier
Transformer models based on attention-based architectures have been significantly successful in establishing state-of-the-art results in natural language processing (NLP). However, recent work about the adversarial robustness of attention-based models shows that their robustness is susceptible to adversarial inputs causing spurious outputs thereby raising questions about the trustworthiness of such models. In this paper, we present BERT Probe which is a python-based package for evaluating…
Transformer models based on attention-based architectures have been significantly successful in establishing state-of-the-art results in natural language processing (NLP). However, recent work about the adversarial robustness of attention-based models shows that their robustness is susceptible to adversarial inputs causing spurious outputs thereby raising questions about the trustworthiness of such models. In this paper, we present BERT Probe which is a python-based package for evaluating robustness to attention attribution based on character-level and word-level evasion attacks and empirically quantifying potential vulnerabilities for sequence classification tasks. Additionally, BERT Probe also provides two out-of-the-box defenses against character-level attention attribution-based evasion attacks.
Andere Autor:innenVeröffentlichung anzeigen -
Hindi/Bengali Sentiment Analysis Using Transfer Learning and Joint Dual Input Learning with Self Attention
BOHR International Journal of Computer Science (BIJCS)
Sentiment Analysis typically refers to using natural language processing, text analysis, and computational linguistics to extract effect and emotion-based information from text data. Our work explores how we can effectively use deep neural networks in transfer learning and joint dual input learning settings to effectively classify sentiments and detect hate speech in Hindi and Bengali data. We start by training Word2Vec word embeddings for the Hindi HASOC dataset and Bengali hate speech and…
Sentiment Analysis typically refers to using natural language processing, text analysis, and computational linguistics to extract effect and emotion-based information from text data. Our work explores how we can effectively use deep neural networks in transfer learning and joint dual input learning settings to effectively classify sentiments and detect hate speech in Hindi and Bengali data. We start by training Word2Vec word embeddings for the Hindi HASOC dataset and Bengali hate speech and then train LSTM and subsequently, employ parameter sharing-based transfer learning to Bengali sentiment classifiers by reusing and fine-tuning the trained weights of Hindi classifiers with both classifiers being used as the baseline in our study. Finally, we use BiLSTM with self-attention in a joint dual input learning setting where we train a single neural network on the Hindi and Bengali datasets simultaneously using their respective embeddings.
Andere Autor:innenVeröffentlichung anzeigen -
White-Box Attacks on Hate-speech BERT Classifiers in German with Explicit and Implicit Character Level Defense
BOHR International Journal of Computer Science (BIJCS)
Attention-based Transformer models have achieved state-of-the-art results in natural language processing(NLP). However, recent work shows that the underlying attention mechanism can be exploited by adversaries to craft malicious inputs designed to induce spurious outputs, thereby harming model performance and trustworthiness. Unlike in the vision domain, the literature examining neural networks under adversarial conditions in the NLP domain is limited and most of it focuses mainly on the…
Attention-based Transformer models have achieved state-of-the-art results in natural language processing(NLP). However, recent work shows that the underlying attention mechanism can be exploited by adversaries to craft malicious inputs designed to induce spurious outputs, thereby harming model performance and trustworthiness. Unlike in the vision domain, the literature examining neural networks under adversarial conditions in the NLP domain is limited and most of it focuses mainly on the English language. In this paper, we first analyze the adversarial robustness of Bidirectional Encoder Representations from Transformers (BERT) models for German datasets. Second, we introduce two novel NLP attacks. Namely, a character-level and word-level attacks, both of which utilize attention scores to calculate where to inject character-level and word-level noise, respectively. Finally, we present two defense strategies against the attacks above. The first implicit character-level defense is a variant of adversarial training, which trains a new classifier capable of abstaining/rejecting certain (ideally adversarial) inputs. The other explicit character-level defense learns a latent representation of the complete training data vocabulary and then maps all tokens of an input example to the same latent space, enabling the replacement of all out of vocabulary tokens with the most similar in-vocabulary tokens based on the cosine similarity metric.
Andere Autor:innenVeröffentlichung anzeigen
Patente
-
System and Methods for Real Time Identification of Change-Induced Incidents in AIOPs Applications
US 96279747 / P202007865PK01
An embodiment includes determining if a new incident report of a new incident matches any resolved incident reports associated with resolved incidents. The embodiment performs a first classification operation on the new incident report to determine if the new incident report is likely to be similar to any resolved incident reports associated with resolved incidents. The embodiment also performs a second classification operation on the new incident report to generate a ranked list of changes…
An embodiment includes determining if a new incident report of a new incident matches any resolved incident reports associated with resolved incidents. The embodiment performs a first classification operation on the new incident report to determine if the new incident report is likely to be similar to any resolved incident reports associated with resolved incidents. The embodiment also performs a second classification operation on the new incident report to generate a ranked list of changes that are likely to be similar to the new incident report. The embodiment outputs the ranked list of changes to an incident manager for evaluation, then receives an input representative of a selected change from among the ranked list of changes responsible for causing the new incident. The embodiment revises the new incident report to include a reference to the selected change.
Andere Erfinder:innenPatent anzeigen
Projekte
-
A Novel Approach for text-to-SQL: Dual Transformers Approach
–
Kuan Xu et. al published their work “SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising” with the main hypothesis that arguing that current transformer based Sequence to Sequence architectures are powerful enough to overcome the shortcomings as stated “SQL queries with different clause order may have exact same semantic meaning and return same results by execution. The token interchangeability may confusion model based on S2S generation. Second, the grammar constraint induced…
Kuan Xu et. al published their work “SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising” with the main hypothesis that arguing that current transformer based Sequence to Sequence architectures are powerful enough to overcome the shortcomings as stated “SQL queries with different clause order may have exact same semantic meaning and return same results by execution. The token interchangeability may confusion model based on S2S generation. Second, the grammar constraint induced by structural logical form is ignored during auto-aggressive decoding, therefore the model may predict SQL with invalid logical form. Third, schema linking, which has been suggested to be the crux of the text-to-SQL task, is not specially addressed by vanilla S2S model.” for solving text-to-SQL task using a seq2seq architecture.
I propose an additional transformer block on top of this S2S approach which can prove to be very useful where there is very little lexical overlap between the input sequence and the schema or the data present in the table. Moreover, this would enable vector-based similarity over generated SQL’s entities for performing the entity linking. -
BERT for Chemical Industrial Domain
–
Pre-training a language model for the Chemical industry, while working at Recobo.ai (Recobo: Regulatory Compliance Bot, A neural search and question-answering solution for the Pharmaceutical and Chemical industry), I encountered a plethora of domain-specific Chemical industry data, as it is a heavily regularized industry. While fine-tuning downstream tasks from checkpoints of the general BERT model gave us above-average results, however, the consensus within our team was that there was a lot of…
Pre-training a language model for the Chemical industry, while working at Recobo.ai (Recobo: Regulatory Compliance Bot, A neural search and question-answering solution for the Pharmaceutical and Chemical industry), I encountered a plethora of domain-specific Chemical industry data, as it is a heavily regularized industry. While fine-tuning downstream tasks from checkpoints of the general BERT model gave us above-average results, however, the consensus within our team was that there was a lot of room for improvement.
-
Multilingual PDF to Text
–
A python library for extracting text from PDFs without losing the formatting of the PDF content.
-
Multi-Task Learning Transformers
–
A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets.
-
Hindi/Bengali Sentiment Analysis Using Transfer Learning and Joint Dual Input Learning with Self Attention
–
Sentiment Analysis typically refers to using natural language processing, text analysis and computational linguistics to extract affect and emotion based information from text data. Our work explores how we can effectively use deep neural networks in transfer learning and joint dual input learning settings to effectively classify sentiments and detect hate speech in Hindi and Bengali data. We start by training Word2Vec word embeddings for Hindi \textbf{HASOC dataset} and Bengali hate speech and…
Sentiment Analysis typically refers to using natural language processing, text analysis and computational linguistics to extract affect and emotion based information from text data. Our work explores how we can effectively use deep neural networks in transfer learning and joint dual input learning settings to effectively classify sentiments and detect hate speech in Hindi and Bengali data. We start by training Word2Vec word embeddings for Hindi \textbf{HASOC dataset} and Bengali hate speech and then train LSTM and subsequently, employ parameter sharing based transfer learning to Bengali sentiment classifiers by reusing and fine-tuning the trained weights of Hindi classifiers with both classifier being used as baseline in our study. Finally, we use BiLSTM with self attention in joint dual input learning setting where we train a single neural network on Hindi and Bengali dataset simultaneously using their respective embeddings.
-
LinkedIn Jobs Scraper
–
Scrape public jobs postings from LinkedIn in native python without selenium or any headless browser.
-
Top 20 movies on popularity and Ratings
–
In this project I have used the Kaggle's TMDB dataset where I have done analysis to compute that which are the top movies, in order to compute the top movies I have used a hybrid metric which used IMDB like weighted rating and movie's popularity score, so in order to achieve this, I first compute IMDB like score using movie rating and rating counts then use these to compute final score to get top movies.
-
Classifying Dog Breeds Using CNNs in PyTorch
–
This project is inspired by the Kaggle Dog Breed competition. The goal of this project is to build a pipeline that could be used in a web or mobile app to process real-world images. The pipeline must be able to:
Detect whether an image contains a dog or human, and distinguish between the two.
If given an image of a dog, the model predicts the canine’s breed with at least 60% accuracy. Random chance is only 0.75% since there are 133 dog breeds (classes) in the dataset.
If given an…This project is inspired by the Kaggle Dog Breed competition. The goal of this project is to build a pipeline that could be used in a web or mobile app to process real-world images. The pipeline must be able to:
Detect whether an image contains a dog or human, and distinguish between the two.
If given an image of a dog, the model predicts the canine’s breed with at least 60% accuracy. Random chance is only 0.75% since there are 133 dog breeds (classes) in the dataset.
If given an image of a human, the model identifies the dog breeds the person most resembles. -
Seinfeld TV scripts generation using RNNs in PyTorch
–
Seinfeld TV scripts generation using RNNs in PyTorch
In this project, I generated my own Seinfeld TV scripts using RNNs. I used part of the Seinfeld dataset of scripts from 9 seasons. The Neural Network I built generated a new fake TV script, based on patterns it recognizes in this training data. -
Predicting Bike Sharing patterns
–
Predicting Bike Sharing Patterns using Neural Network
In this project I have used Neural Network to predict number of bikes needed by the Bike sharing company in their stock based on factors like location, weather etc. Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent a bike from a particular position and return back at another position…
Predicting Bike Sharing Patterns using Neural Network
In this project I have used Neural Network to predict number of bikes needed by the Bike sharing company in their stock based on factors like location, weather etc. Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent a bike from a particular position and return back at another position. Currently, there are about over 500 bike-sharing programs around the world which is composed of over 500 thousands bicycles. Today, there exists great interest in these systems due to their important role in traffic, environmental and health issues. The data comes from the UCI Machine Learning Database. -
Microfinance Loan Credit Scoring
–
The dairy farmers provide milk to the food company. The food company in return makes weekly payments to the farmers which are routed through the bank. The bank is offering loans to the dairy farmers against these payments. Duration of loan is 13 weeks. After a farmer has acquired loan, it'll be repaid in weekly installments. These installments are deducted from payments of the food company to the farmer over the course of next 13 weeks. Challenge: Given historical 1 year (52 weeks) of payment…
The dairy farmers provide milk to the food company. The food company in return makes weekly payments to the farmers which are routed through the bank. The bank is offering loans to the dairy farmers against these payments. Duration of loan is 13 weeks. After a farmer has acquired loan, it'll be repaid in weekly installments. These installments are deducted from payments of the food company to the farmer over the course of next 13 weeks. Challenge: Given historical 1 year (52 weeks) of payment data, you role as analyst is to identify Which farmers are most suitable to target for loan product X? In other words, which farmers are likely to have consistent payments in next 13 weeks? How much loan should be offered to each farmer?
-
E Crime File Management System
–
This project was made in JAVA EE for the web part and phone gap & web services in JAVA EE for the android part. Its main features were:
Online crime reporting
Crime statistics
Crime investigation details
Offline chat with police
Record management of Wanted Criminals
Record Management of Missing PersonsAndere Mitarbeiter:innen
Auszeichnungen/Preise
-
IMPRS-CS Scholarship
Max-Planck-Institut für Informatik
Fully funded graduate school merit-based scholarship of graduate school, by the International Max Planck Research School for Computer Science (IMPRS-CS) which is a graduate program jointly run by the Max Planck Institute for Informatics (MPI-INF), the Max Planck Institute for Software Systems (MPI-SWS), and the Computer Science Department at Saarland University. The IMPRS for Computer Science (IMPRS-CS) is highly research-oriented.
-
Facebook Pytorch Udacity Scholarship
Udacity
-
Double Gold Medalist
Comsats Institute of Technology, Islamabad
I secured the top position in Comsats Institute of Technology Islamabad campus, as well as across all other campuses hence I got Institute gold medal as well as campus gold medal.
Prüfungsergebnisse
-
IELTS(Academic)
Prüfungsergebnis: 8.0/9.0
Sprachen
-
English
-
-
German
-
-
Urdu
-
Erhaltene Empfehlungen
1 Person hat Shahrukh Khan empfohlen
Jetzt anmelden und ansehenWeitere Aktivitäten von Shahrukh Khan
-
We are hiring research scientists 🚨‼️ Feel free to send me a DM if you're interested (and have experience) in efficient architecture design and/or…
We are hiring research scientists 🚨‼️ Feel free to send me a DM if you're interested (and have experience) in efficient architecture design and/or…
Beliebt bei Shahrukh Khan
-
Introducing Picotron: a minimal educational repository for 4D parallelism, showcasing all the techniques behind LLM training Inspired by Andrej…
Introducing Picotron: a minimal educational repository for 4D parallelism, showcasing all the techniques behind LLM training Inspired by Andrej…
Beliebt bei Shahrukh Khan
-
BERT was completely forgotten by the media & AI hype but was probably the most influential model of all times (thanks Google)! Excited about…
BERT was completely forgotten by the media & AI hype but was probably the most influential model of all times (thanks Google)! Excited about…
Beliebt bei Shahrukh Khan
-
It doesn't seem to be a secret anymore that we are about to reach the limits of AI-capabilities by focusing only on models and model sizes. But this…
It doesn't seem to be a secret anymore that we are about to reach the limits of AI-capabilities by focusing only on models and model sizes. But this…
Beliebt bei Shahrukh Khan
-
Just 10 days after o1's public debut, we’re thrilled to unveil the open-source version of the groundbreaking technique behind its success: scaling…
Just 10 days after o1's public debut, we’re thrilled to unveil the open-source version of the groundbreaking technique behind its success: scaling…
Beliebt bei Shahrukh Khan
-
Llama 3.3 fine-tuning with ultra long context is here! 🦙 Unsloth now supports 89K context for Meta Llama 3.3 (70B) on a 80GB GPU - which is 13x…
Llama 3.3 fine-tuning with ultra long context is here! 🦙 Unsloth now supports 89K context for Meta Llama 3.3 (70B) on a 80GB GPU - which is 13x…
Beliebt bei Shahrukh Khan
-
🏆 We’re thrilled to announce that Benjamin Tuezen has been appointed as our new Chief Operating Officer! Over the past four years, Ben has been an…
🏆 We’re thrilled to announce that Benjamin Tuezen has been appointed as our new Chief Operating Officer! Over the past four years, Ben has been an…
Beliebt bei Shahrukh Khan
-
It has been a while since the first release of the GLiNER multi-task model. During that time we continuously researched different directions for…
It has been a while since the first release of the GLiNER multi-task model. During that time we continuously researched different directions for…
Beliebt bei Shahrukh Khan
-
Say goodbye to Bitcoin and all other fancy pre-quantum stuff. And ya, all your passwords are gonna get cracked as well For perspective : Google's…
Say goodbye to Bitcoin and all other fancy pre-quantum stuff. And ya, all your passwords are gonna get cracked as well For perspective : Google's…
Beliebt bei Shahrukh Khan
-
My latest project is the outcome of the last 2+ years working with TPUs from the amazing TPU Research Cloud program and training Encoder-only LMs…
My latest project is the outcome of the last 2+ years working with TPUs from the amazing TPU Research Cloud program and training Encoder-only LMs…
Beliebt bei Shahrukh Khan
-
They made me a GitHub star for FastAPI and other projects! 🤩✨ Who nominated me? (I have no idea 😅). Thank you! 😊 https://lnkd.in/dUTnDVSH
They made me a GitHub star for FastAPI and other projects! 🤩✨ Who nominated me? (I have no idea 😅). Thank you! 😊 https://lnkd.in/dUTnDVSH
Beliebt bei Shahrukh Khan
-
Neuigkeiten zum Nikolaustag! Wir haben ein besonderes Geschenk für alle, die auf der Suche nach einem leistungsstarken jedoch kleinen…
Neuigkeiten zum Nikolaustag! Wir haben ein besonderes Geschenk für alle, die auf der Suche nach einem leistungsstarken jedoch kleinen…
Beliebt bei Shahrukh Khan
-
I'm looking for an intern to introduce Sparse Embedding models to Sentence Transformers! If you're passionate about open source, interested in…
I'm looking for an intern to introduce Sparse Embedding models to Sentence Transformers! If you're passionate about open source, interested in…
Beliebt bei Shahrukh Khan
Weitere ähnliche Profile
Weitere Mitglieder namens Shahrukh Khan in Deutschland
-
Shahrukh Khan
-
Shahrukh Khan
-
Shahrukh Khan
Senior Product Designer
-
Shahrukh Khan
Studium an der Hochschule Hochschule für Angewandte Wissenschaften Hamburg (HAW Hamburg)
Es gibt auf LinkedIn 11 weitere Personen namens Shahrukh Khan, die sich in Deutschland befinden.
Weitere Mitglieder anzeigen, die Shahrukh Khan heißen