Suvarna Narayanan Baratharaj’s Post

Developer | home.alles-tools.com

2mo

A postgres data dump of over 3 million news articles from Indian news providers I aggregated these to make a semantic search website with different analytics like title tone analysis and be able to track how news evolved around a given context. Sadly Life got busier; I will come back to this later on, but feel free to use this dataset to make something of your own. A lot more can be done with this dataset if you are more data-savvy :) dataset link: https://lnkd.in/gxAjRbXi I do have a slightly chonkier dump of 3.3 mil records from more news providers in my VPS - do send a DM requesting it. #datadump #datascience #data

GitHub - SuvarnaNarayanan/Indian-News-Articles: A dump of Indian news articles from the hindu and times of india.

github.com

To view or add a comment, sign in

More Relevant Posts

Nagendra Varma P.

Data Analyst | Data Specialist
9mo Edited
Report this post
5 SQL Tricks to clean your data

Title: 𝗦𝗵𝗶𝘃𝗮𝗻 𝗸𝘂𝗺𝗮𝗿 || 𝗦𝗲𝗻𝗶𝗼𝗿 𝗗𝗮𝘁𝗮 ...

google.com
Like Comment
To view or add a comment, sign in
Adarsh Shiragannavar

Generative AI | SaaS | PaaS | Real-Time Apps | Customer Experience | AI Consulting
2mo
Report this post
Following our Notion clone tutorial? Check out the 2nd post from Valeri Karpov (the creator of Mongoose) as he demonstrates the power of RAG and Astra DB when building an open-source clone of Notion. #DataStax #VectorDB #RAGApplications https://ow.ly/SwCJ50TBqAI
Like Comment
To view or add a comment, sign in
Sadan Subedi

Programmer|| Tech Fanatic||CS-Student
2mo
Report this post
Populate method of Mongoose 😍 Scenario :: whenever i clicked add to cart button, I pass the respective product _id ,initially i was thinking to access the product details of that specifc product _id by using fetch method but later on i got to know the populate method in mongoose,here I try to explain how we can implement it

2 Comments
Like Comment
To view or add a comment, sign in
Icinga GmbH

1,683 followers
3mo
Report this post
Overusing dictionaries in monitoring can lead to complex, cluttered configs! In recent blog from Ravi shows how to clone dictionary row entries for objects from import sources in Icinga Director. This guide will help you improve your configurations: https://lnkd.in/eVbZWmaD #IcingaDirector

Icinga Director: Cloning dictionary row entries for objects from import sources

https://icinga.com

1 Comment
Like Comment
To view or add a comment, sign in
Eduardo Bellani

Hands-on technical leader -- helping companies with strategic software assets to to achieve their goals by developing software correctly and on time, either individually or by building and leading high-performance teams.
5mo
Report this post
On the value of semantics: https://lnkd.in/dsRTMerT

PostgreSQL subtransactions, savepoints, and exception blocks

franckpachot.medium.com
Like Comment
To view or add a comment, sign in
Hiren Parkar

Data Analyst | Data Engineer | Python, SQL, Excel, Power BI | MSc IT, Vidyalankar School of IT
3mo Edited
Report this post
Excited to Embark on My Data Analytics Journey! After completing my MSc in Information Technology from Vidyalankar School of Information Technology, I am thrilled to start my career as a Data Analyst. During my studies, I developed a deep passion for data—how it can drive business decisions, uncover hidden insights, and solve real-world problems. Through hands-on projects, I’ve sharpened my skills in Python, SQL, and data visualization, and I’ve been applying machine learning techniques to solve analytical challenges. You can explore my projects on GitHub. ✨ A few highlights from my portfolio: - Developed a customer churn prediction model using machine learning to improve retention strategies. - Built an interactive sales dashboard in Power BI, delivering actionable insights for business decisions. - Conducted exploratory data analysis on a retail dataset, identifying key sales trends to optimize future strategies. Though I am early in my career, I am continuously expanding my knowledge and adding more projects to my portfolio. Feel free to check out my work here: github.com/hirenparkar I am excited to connect with industry experts, exchange knowledge, and explore new opportunities in the world of data analytics. If you're seeking a motivated data enthusiast ready to dive into meaningful analytics, I’d love to connect! #DataAnalytics #MScIT #DataScience #Python #SQL #PowerBI #MachineLearning #DataVisualization

hirenparkar - Overview

github.com

21 Comments
Like Comment
To view or add a comment, sign in
Slava Zagriichuk

Data Science | Machine Learning | Data Engineering - I make data a useful tool.
4w
Report this post
Recently, I tried to figure out how to work with Finnish Statistics API. It turned out not to be any difficult, however I noticed that there is impossible to get the types of aggregation (for example for district divisions like areas, municipalities, city/countryside and others) without using the visual interface. Essentially this is not a big problem, as I could fetch this from the website and then hardcode. However, I wrote to developers about my finding and got the answer, that the problem really exists and it's impossible to fix it with the current architecture of API. They are going to make API v2 with better functionality in the early 2025. Well, this is great news, I'll wait! So far I wrote a piece of code in which I left the examples of requests to current API that works well. Here it is: https://lnkd.in/drrncQxc

GitHub - slava-zagriichuk/StatFin: Studies over Finnish statistics database (StatFin)

github.com
Like Comment
To view or add a comment, sign in
Amanda Allan

Manager, Cloudera Executive Briefing Program
6mo
Report this post
Check out the latest blog from Venkat Rajaji - what went down in the news this week and what it can mean for your org https://lnkd.in/dEvTaUgE

Databricks Follows Cloudera by Adopting Iceberg, While Snowflake Mulls Open Source Approach - Cloudera Blog

https://blog.cloudera.com
Like Comment
To view or add a comment, sign in
Manjupriya selvam

Artificial intelligence and data science
7mo
Report this post
https://lnkd.in/gGPS78Af K-means clustering is a popular algorithm used for grouping similar data points together. It's often used in data analysis and machine learning. With K-means clustering, you can partition a dataset into K distinct clusters based on their similarities. The algorithm iteratively assigns data points to the nearest cluster centroid and updates the centroids until convergence. It's a powerful technique for finding patterns and structures in data. Let me know if you'd like more details or have any specific questions! 😊📊

GitHub - Smanjupriya/prodigy_ML_02: K-means clustering

github.com
Like Comment
To view or add a comment, sign in
LlamaIndex

227,246 followers
6mo
Report this post
3 Forms of Query Rewriting for RAG ✍️ Good RAG requires good retrieval, and good retrieval requires a good query understanding layer. This is a comprehensive resource by `zhaozhiming` showing you 3 key patterns for adding a query rewriting layer for better question-handling for your RAG pipelines 🔥 1. Sub-question decomposition: Break a complex question into sub-questions. Unlike pure chain of thought, you can break a question down into a parallelizable sub-questions that you can try answering all at once. 2. HyDE: rewrite the question to hallucinate an answer that better aligns with the embedding semantics. 3. Step-back prompting (from scratch ✨): To answer a complex question, take a “step back” and answer a more generic question to better answer the specific one. Blog: https://lnkd.in/gj4Va86w
8 Comments
Like Comment
To view or add a comment, sign in

565 followers

19 Posts

View Profile Connect

Suvarna Narayanan Baratharaj’s Post

More Relevant Posts

Explore topics