Uncle Data

Uncle Data

Technology, Information and Media

Newsletter and podcast episodes to cover all things related to data.

About us

Covering data related matters with blog posts and podcasts with data people.

Industry
Technology, Information and Media
Company size
1 employee
Headquarters
Vilnius
Type
Self-Employed

Locations

Updates

  • I might be late to the train, but have you heard about SQLGlot? It's an open-source library that parses SQL (https://lnkd.in/dqgdS53j); it's used in many commercial applications (dbt cloud included for column-level lineage), SQLMesh uses it, and many more usages. I started to play with it more actively at work since we have a lot of SQL code, and we're looking to enhance our current data lineage solution, which is built in-house. "First I was afraid, I was petrified" 🎵 , but after a quick try, I started to get more into it, and oh boy, how fun it is to use it. It's based on generators (some people would have to adjust their habits working with already computed lists). It works better on very complicated queries compared to our test with the OpenLineage parser. I have a feeling that it's going to become a blog post about it. --- Stay in the loop for the latest insights from Uncle Data! * Click the notification bell icon 🔔 to receive updates on Linkedin * Subscribe to my Newsletter on Substack ✏ * Listen to Uncle Data Podcast ⌨️ Listen. Learn. Connect --- #sql #parsing #sqlglot #lineage

    GitHub - tobymao/sqlglot: Python SQL Parser and Transpiler

    GitHub - tobymao/sqlglot: Python SQL Parser and Transpiler

    github.com

  • This week on Uncle Data. Haha, sounds like a good recap of what happened in the last episode of your favourite TV series. But actually it's a bit of that. If you've been following me you might have seen that I ran a series on different data roles and how I perceived them, now comes a long awaited cross over between them, how I see them interconnect and how can you move from one to another. Keep in mind that it's completely based on my experiences and some of them might be just biased of how things are working here in Lithuania. I'm super excited to hear your thoughts on different interactions and roles. Feel free to DM or comment your thoughts! ---- Stay in the loop for the latest insights from Uncle Data! * Click the notification bell icon 🔔 to receive updates on Linkedin * Subscribe to my Newsletter on Substack ✏ * Listen to Uncle Data Podcast ⌨️ Listen. Learn. Connect ---- #data #career #growth #uncledata

    Into the Data Role Multiverse

    Into the Data Role Multiverse

    uncledata.substack.com

  • Certifications and bootcamps. I've spent quite a while hiring people throughout my career, but before being a manager, I was also an Individual Contributor (like I am now!). I was pushed to take some exams and get certified; some seemed like nice pieces of showing off, which was fun and exciting for me! The interesting thing is that when I became a manager, my scepticism about those exams and certifications rose relatively high. Not to be too harsh, but most seemed like vanity certificates (they say precisely nothing about your skill level). You see a guy with 10+ certifications ask a simple question, and he can't answer it, even though he's certified. Jobs in the data world became highly paid, creating a surge in boot camps that will prepare you for the job. Sure, if you put in much effort and are curious about what you do, you'll be fine. Curiosity and constant learning make you stand out in the data world. Getting the basics makes sense, but add more effort if you want to shine and get hired faster (or even for more money). Show your skills through examples! Basically, this post on my substack is my thoughts about these topics. I'm interested to hear your thoughts. Is it just me seeing it like this? --- Stay in the loop for the latest insights from Uncle Data! * Click the notification bell icon 🔔 to receive updates on Linkedin * Subscribe to my Newsletter on Substack ✏ * Listen to Uncle Data Podcast ⌨️ Listen. Learn. Connect ---

    Why is motivation important in the Data World?

    Why is motivation important in the Data World?

    uncledata.substack.com

  • Tips on dictionaries! Personal preference not to use "update" option, since it's an in place operation and I hate mutability 😅 #python #data #tip #uncledata

    View profile for Tomas Peluritis, graphic

    Data Person | Data Engineering, Technical Leadership | Uncle Data

    When it comes to dictionary updates, there used to be two options: the not pretty, but reliable (The Ol’ reliable) and the mutable one (that updates in place). From Python 3.9, one more was introduced that does the same as The Ol’ Reliable, but easier to understand and fewer characters. You might wonder why I’m bashing the in-place updating one - it’s simple. It makes your code more complicated to understand because it can update your dictionary somewhere, and you’ll have to scratch your head to figure out where. Using unpacking or “|” makes it easier to understand and debug, and no magical updates happen! ---- Stay in the loop for the latest insights from Uncle Data! * Click the notification bell icon 🔔 to receive updates on Linkedin * Subscribe to my Newsletter on Substack ✏ * Listen to Uncle Data Podcast ⌨️ Listen. Learn. Connect ---- #uncledata #python #inplaceupdate #dictionary

  • Being lazy is a blessing and a curse at the same time. You can think of exciting and fancy solutions; iteration is fast, but you most likely will leave some overhead just because you're lazy! This time, I'm sharing some of my findings while still learning the ropes of rust. ---- Stay in the loop for the latest insights from Uncle Data! * Click the notification bell icon 🔔 to receive updates on Linkedin * Subscribe to my Newsletter on Substack ✏ * Listen to Uncle Data Podcast ⌨️ Listen. Learn. Connect ---- #uncledata #rust #python #learning

    View profile for Tomas Peluritis, graphic

    Data Person | Data Engineering, Technical Leadership | Uncle Data

    I’m lazy. While coding in Python, I tend to create a bunch of variables along the way that I’m not using, leaving a big mess. While I’m still learning the ropes of Rust, it’s kicking me for leaving unused stuff. If a variable is not used, you have to remove it, or you’ll have to look for those annoying warnings. I’m the type of person who doesn’t like warnings in my code, so naturally, I’m cleaning it up. That also makes your code cleaner and easier to read! Here is an image of my attempt to complete Advent of Code after it ended in Rust. One day at a time! ---- Stay in the loop for the latest insights from Uncle Data! * Click the notification bell icon 🔔 to receive updates on Linkedin * Subscribe to my Newsletter on Substack ✏ * Listen to Uncle Data Podcast ⌨️ Listen. Learn. Connect ---- #uncledata #rust #python

    • No alternative text description for this image

Similar pages