John Hogue

John Hogue

Minneapolis, Minnesota, United States
6K followers 500+ connections

About

Scaling AI, data and analytics is hard, John Hogue has over a decade of in-depth…

Articles by John

Activity

Join now to see all activity

Experience

  • Rogue Hogue, LLC Graphic

    Rogue Hogue, LLC

    Minneapolis, Minnesota, United States

  • -

    Greater Minneapolis-St. Paul Area

  • -

    St Paul, Minnesota, United States

  • -

  • -

    Minneapolis, Minnesota

  • -

    Greater Minneapolis-St. Paul Area

  • -

    Greater Minneapolis-St. Paul Area

  • -

    Greater Minneapolis-St. Paul Area

  • -

    Golden Valley, MN

  • -

    Richfield, MN

  • -

    Eden Praire

  • -

    90 S 7th St. Suite 5300 Minneapolis, MN 55402

  • -

  • -

  • -

    Shanghai City, China

Education

  • University of St. Thomas Graphic
  • -

    Activities and Societies: Founded Campus Wargamers -student group for tabletop strategy games.

Licenses & Certifications

Publications

  • Feature Engineering with PySpark

    DataCamp

    The real world is messy and your job is to make sense of it. Toy datasets like MTCars and Iris are the result of careful curation and cleaning, even so the data needs to be transformed for it to be useful for powerful machine learning algorithms to extract meaning, forecast, classify or cluster. This course will cover the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering. With size of datasets now becoming ever larger, let's use…

    The real world is messy and your job is to make sense of it. Toy datasets like MTCars and Iris are the result of careful curation and cleaning, even so the data needs to be transformed for it to be useful for powerful machine learning algorithms to extract meaning, forecast, classify or cluster. This course will cover the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering. With size of datasets now becoming ever larger, let's use PySpark to cut this Big Data problem down to size!

    See publication

Projects

  • Anomaly Detection A to Z

    - Present

    Behold! The heartbreaking! The hair-raising! Values too big or too small to believe! Hear about the messy science of anomaly detection! Walk through the nuances of different kinds of anomaly and outlier detection methods! And for a limited time only, you too can see how to highlight truly SHOCKING events!

    See project
  • Intro to Geospatial Data using Python

    Data comes in all shapes and sizes and often government data is geospatial in nature. Often times data science programs & tutorials ignore how to work with this rich data to make room for more advanced topics. Our MinneMUDAC competition heavily utilized geospatial data but was processed to provide students a more familiar format. But as good scientists, we should use primary sources of information as often as possible.

    Why use this Notebook?

    Use this Notebook to get a basic…

    Data comes in all shapes and sizes and often government data is geospatial in nature. Often times data science programs & tutorials ignore how to work with this rich data to make room for more advanced topics. Our MinneMUDAC competition heavily utilized geospatial data but was processed to provide students a more familiar format. But as good scientists, we should use primary sources of information as often as possible.

    Why use this Notebook?

    Use this Notebook to get a basic understanding of how to read, write, query, perform geospatial calculations and join data sets together. Along the way you will see some tips to preprocessing data for analysis and some tricks to ensure you are computing efficiently. This Notebook is be focused on Minnesota Tax shapefiles, MetCouncil Water Features and MN PCA Lake Quality Attributes all of which were the focus of our Dive Into Water (Data) Competition. It is meant as a way to give you real data, real code and a real problem to work through.

    Social Data Science hopes you take what you learn here and use it to improve the world around you!

    See project
  • Natural Language Processing with PySpark

    Ready to move beyond Word Count? Watch as John Hogue walks through a practical example of a data pipeline to feed textual data for tagging with PySpark and ML. Learn to leverage great existing Python libraries in Spark such as NLTK and how to use some of Spark’s newer features. A GitHub Repo of source code, training and test sets of data will be provided for attendees to explore and play with.

    https://www.youtube.com/watch?v=AsW0QzbYVow

    See project
  • Pytrends: Google Trends Automation

    Simple interface for automating downloads of csv reports from Google Trends.

    See project

Languages

  • Chinese

    Professional working proficiency

  • English

    Native or bilingual proficiency

Organizations

  • University of St Thomas

    Strategic Advisory Board Member

    - Present

    •Provide direction on industry and technology changes that impact Graduate Programs in Software. •Participate in the strategic planning process of Graduate Programs in Software.

  • MinneAnalytics

    Board Member

    - Present

    •Organized annual conferences such as Data Tech and Food, Ag, Sustainability, & Supply Chain, in Tech, bringing in 1000+ participants and dozens of industry speakers •Facilitate Judging for student data analytics competitions bringing in 250+ students from over 30 institutions and 300 analytic professionals to judge.

Recommendations received

More activity by John

View John’s full profile

  • See who you know in common
  • Get introduced
  • Contact John directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named John Hogue in United States