About
Scaling AI, data and analytics is hard, John Hogue has over a decade of in-depth…
Articles by John
Activity
-
It was great to present to such a highly engaged board about the current student experience!
It was great to present to such a highly engaged board about the current student experience!
Shared by John Hogue
-
There is a new Meetup for Solution Architects starting! First meeting is this Wednesday and it's hosted by Improving. Hope I see you there.
There is a new Meetup for Solution Architects starting! First meeting is this Wednesday and it's hosted by Improving. Hope I see you there.
Liked by John Hogue
-
Got a question that needs answering? We'll bring the brains, you bring the data!
Got a question that needs answering? We'll bring the brains, you bring the data!
Shared by John Hogue
Experience
Education
Licenses & Certifications
Publications
-
Feature Engineering with PySpark
DataCamp
The real world is messy and your job is to make sense of it. Toy datasets like MTCars and Iris are the result of careful curation and cleaning, even so the data needs to be transformed for it to be useful for powerful machine learning algorithms to extract meaning, forecast, classify or cluster. This course will cover the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering. With size of datasets now becoming ever larger, let's use…
The real world is messy and your job is to make sense of it. Toy datasets like MTCars and Iris are the result of careful curation and cleaning, even so the data needs to be transformed for it to be useful for powerful machine learning algorithms to extract meaning, forecast, classify or cluster. This course will cover the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering. With size of datasets now becoming ever larger, let's use PySpark to cut this Big Data problem down to size!
Projects
-
Anomaly Detection A to Z
- Present
Behold! The heartbreaking! The hair-raising! Values too big or too small to believe! Hear about the messy science of anomaly detection! Walk through the nuances of different kinds of anomaly and outlier detection methods! And for a limited time only, you too can see how to highlight truly SHOCKING events!
-
Intro to Geospatial Data using Python
Data comes in all shapes and sizes and often government data is geospatial in nature. Often times data science programs & tutorials ignore how to work with this rich data to make room for more advanced topics. Our MinneMUDAC competition heavily utilized geospatial data but was processed to provide students a more familiar format. But as good scientists, we should use primary sources of information as often as possible.
Why use this Notebook?
Use this Notebook to get a basic…Data comes in all shapes and sizes and often government data is geospatial in nature. Often times data science programs & tutorials ignore how to work with this rich data to make room for more advanced topics. Our MinneMUDAC competition heavily utilized geospatial data but was processed to provide students a more familiar format. But as good scientists, we should use primary sources of information as often as possible.
Why use this Notebook?
Use this Notebook to get a basic understanding of how to read, write, query, perform geospatial calculations and join data sets together. Along the way you will see some tips to preprocessing data for analysis and some tricks to ensure you are computing efficiently. This Notebook is be focused on Minnesota Tax shapefiles, MetCouncil Water Features and MN PCA Lake Quality Attributes all of which were the focus of our Dive Into Water (Data) Competition. It is meant as a way to give you real data, real code and a real problem to work through.
Social Data Science hopes you take what you learn here and use it to improve the world around you! -
Natural Language Processing with PySpark
Ready to move beyond Word Count? Watch as John Hogue walks through a practical example of a data pipeline to feed textual data for tagging with PySpark and ML. Learn to leverage great existing Python libraries in Spark such as NLTK and how to use some of Spark’s newer features. A GitHub Repo of source code, training and test sets of data will be provided for attendees to explore and play with.
https://www.youtube.com/watch?v=AsW0QzbYVow
-
Pytrends: Google Trends Automation
Simple interface for automating downloads of csv reports from Google Trends.
Languages
-
Chinese
Professional working proficiency
-
English
Native or bilingual proficiency
Organizations
-
University of St Thomas
Strategic Advisory Board Member
- Present•Provide direction on industry and technology changes that impact Graduate Programs in Software. •Participate in the strategic planning process of Graduate Programs in Software.
-
MinneAnalytics
Board Member
- Present•Organized annual conferences such as Data Tech and Food, Ag, Sustainability, & Supply Chain, in Tech, bringing in 1000+ participants and dozens of industry speakers •Facilitate Judging for student data analytics competitions bringing in 250+ students from over 30 institutions and 300 analytic professionals to judge.
Recommendations received
8 people have recommended John
Join now to viewMore activity by John
-
Pretty excited to bring this public! For the past few months, Daniel McKenzie Jonathan Zderad Destiny Vorbeck and I have been putting pen to paper…
Pretty excited to bring this public! For the past few months, Daniel McKenzie Jonathan Zderad Destiny Vorbeck and I have been putting pen to paper…
Liked by John Hogue
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore MoreOthers named John Hogue in United States
-
John Hogue
Vice President - Global Commercial Credit & Underwriting at American Express
-
John Hogue
Senior Consultant at Unify Consulting
-
John Hogue
General Manager - Crane Systems - KY DESHAZO CRANE
-
John Hogue
Engineer focused on producing creative and unique ideas to solve engineering challenges of various sizes and complexities.
106 others named John Hogue in United States are on LinkedIn
See others named John Hogue