Ajay Halthor

Ajay Halthor

Los Angeles, California, United States
4K followers 500+ connections

About

Hi 👋 Your friendly neighborhood machine learning engineer Ajay here! I love learning and…

Activity

Join now to see all activity

Experience

  • GOAT Group Graphic
  • -

  • -

    Greater Los Angeles Area

  • -

    San Francisco, California

  • -

    India

  • -

    Mysuru Area, India

  • -

    Sydney, Australia

  • -

    Mysuru Area, India

  • -

    Mysuru Area, India

Education

Licenses & Certifications

Publications

  • A Kannada Speech to Text System

    Unpublished

    The objective of this paper is to demonstrate a Speech to Text system for Kannada, an Indian language. Using the Hidden Markov Model Toolkit (HTK), the system employs a Hidden Markov Model (HMM) based approach for speech to text conversion. Speech input as WAV files are converted into feature vectors in the form of Mel Frequency Cepstral Coefficients (MFCCs). Dictionary words are broken down into corresponding subwords: monophones, triphones or syllables; that are used to determine the text of…

    The objective of this paper is to demonstrate a Speech to Text system for Kannada, an Indian language. Using the Hidden Markov Model Toolkit (HTK), the system employs a Hidden Markov Model (HMM) based approach for speech to text conversion. Speech input as WAV files are converted into feature vectors in the form of Mel Frequency Cepstral Coefficients (MFCCs). Dictionary words are broken down into corresponding subwords: monophones, triphones or syllables; that are used to determine the text of the input speech wave form. We introduce a pure Kannada method of generating subword sequences that can be generalized for any Indian Language. Each subword is trained against dictionaries of 500 and 750 words with sample rates 16 kHz, 22.05 kHz, 44.1 kHz. Training against samples sentences of varying lengths, we obtained a 95.73% accuracy for single user isolated word recognition with a dictionary of 750 words and up to a 98.2% accuracy for a 500 word dictionary.

  • Prediction of Visual Perception from BOLD fMRI

    IEEE

    Presented at the 2nd IEEE International Conference on Communication Systems, Computing & IT Applications (CSCITA) at Mumbai, 2017.

    The goal of this paper is to determine the object a person visually perceives by analyzing BOLD fMRI data. We analyze the effects of univariate and multivariate feature selection techniques on Haxby's Dataset. I introduce a novel technique, multi slicing and compare it to existing techniques. We obtained a 93.16% accuracy: higher than the state of the art…

    Presented at the 2nd IEEE International Conference on Communication Systems, Computing & IT Applications (CSCITA) at Mumbai, 2017.

    The goal of this paper is to determine the object a person visually perceives by analyzing BOLD fMRI data. We analyze the effects of univariate and multivariate feature selection techniques on Haxby's Dataset. I introduce a novel technique, multi slicing and compare it to existing techniques. We obtained a 93.16% accuracy: higher than the state of the art 92%.

    Other authors
    See publication

Projects

  • Text translation and Speech Conversion system for Kannada

    This is an interdisciplinary project spanning Computer Vision and NLP. We Scan English text with a smartphone camera where it is then translated and converted to kannada speech. The translated text is displayed seamlessly in place of the original text with Augmented Reality.

    Other creators
  • Auto keyphrase generation

    I generate a list of “candidate phrases” and classify each phrase as ‘keyphrase’ or ‘not keyphrase’ based on co-occurrence of phrases and CTR (click through rate) with respect to Google Adwords. CTR helps us chose the phrases which will generate more traffic. Without Adwords, I get an 81% agreement with 20 modern key phrase taggers.

  • Text Summarization

    Creating a modification of Bayes Algorithm for summarising text. By retaining flow of ideas with paragraphing, my implementation accepts a more flexible text input than available online summary tools and delivers cogent summaries for general essays, reviews and publications

  • Television Network Database

    Developed a responsive website analogous to Dish TV, Time Warner by incorporating a login portal, transaction management system and a customer review platform.

    Other creators
  • Verification of Declaration Section of a C Program

    Compilers usually involve three basic steps - scanning, parsing and code generation. In this project, we verify the declaration section of a C program We have achieved our project objective by using Lex, Yacc and Java.

    LEX and YACC help to write programs that transform structured input. This includes an erroneous range of applications - anything from a simple text search program that looks for patterns in the input file to a C Compiler that transforms a source program into optimized…

    Compilers usually involve three basic steps - scanning, parsing and code generation. In this project, we verify the declaration section of a C program We have achieved our project objective by using Lex, Yacc and Java.

    LEX and YACC help to write programs that transform structured input. This includes an erroneous range of applications - anything from a simple text search program that looks for patterns in the input file to a C Compiler that transforms a source program into optimized object code.

    For a C program, the units are variable names, constants, strings, operators, punctuation and so forth. This division into units called tokens, is known as lexical analysis or lexing.

    LEX takes a set of descriptions of possible tokens and produces a C routine which we call a lexical analyzer or a lexer, or a scanner for short. The scanner then identifies those tokens. The set of descriptions you give to Lex is called a Lex specification. As the input is divided into tokens, program often needs to establish the relationship among the tokens. A C Compiler needs to find the expressions, statement, declarations, blocks, and procedures in the program. This task is known as parsing and the list of rules that defines the relationships that the program understands, is a grammar.

    Yacc takes a concise description of a grammar and produces a C routine that can parse that grammar. This Yacc parser automatically detects whenever a sequence of input tokens matches one of the rules in the grammar and takes some action as defined. It also detects a syntax error whenever its input does not match any of the rules.

    Thus when Lex and Yacc are used in conjunction with each other, the end result is a compiled source code, free from syntax errors.

    The GUI is designed using Java Swings. It interacts with the executable file that does all the heavy lifting and displays appropriate messages.

    Other creators
  • DFA to NFA Converter

    As a part of the Finite automaton course, built a Java program to convert a deterministic finite automaton to a non deterministic finite automaton.

  • Packet Path Traversal Generator

    Developed and implemented a novel algorithm to reduce the rate at which packets are dropped at every node in a network. Incorporated Fairness in a network and hence combatted congestion (using Java).

    Other creators
  • Passport Information System

    To externally modify passport information in an excel spreadsheet. Interfaced with the java API: Aspose Cells

  • Websites

    -

    1. Personal Website for Piano covers, blog posts and AMVs with WordPress
    2. TV service provider website with a customer review platform
    3. Teacher Appraisal Web Portal
    4. Business advertisement web platfrom for businesses across Iran

Languages

  • English

    -

Organizations

  • All India Institute of Speech and Hearning

    Developer

    -

    Earlier this year, I led a team that presented an interdisciplinary project, spanning Computer Vision and NLP : A text translation and speech conversion system for vernacular languages. This Hackathon competition was held at the All India Institute of Speech and Hearing (AIISH). The project targets conversion of English text to speech in Kannada: an indian language. Our goal is to eliminate language barriers and ensure that everyone has equal access to the world’s most advanced technologies. As…

    Earlier this year, I led a team that presented an interdisciplinary project, spanning Computer Vision and NLP : A text translation and speech conversion system for vernacular languages. This Hackathon competition was held at the All India Institute of Speech and Hearing (AIISH). The project targets conversion of English text to speech in Kannada: an indian language. Our goal is to eliminate language barriers and ensure that everyone has equal access to the world’s most advanced technologies. As one of the top proposals, we have the privilege of working with AIISH to convert this project into a marketable product.

  • Shazin Gostar

    Fullstack developer and Manager

    -

    Developed a business consumer interaction platform analogous to Yelp for businesses across Iran

  • Classe365

    Digital Evangelist

    Screencast tutorials, Blogging and creating a startup video

Recommendations received

More activity by Ajay

View Ajay’s full profile

  • See who you know in common
  • Get introduced
  • Contact Ajay directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Add new skills with these courses