Arpit Bhayani’s Post

60% of the tables I have recently designed, do not have the standard auto-increment ID column. Although counter-intuitive, doing this has helped me gain a significant performance boost because of smaller table sizes (data plus index); particularly for the cases where we paginate more than we do pointed reads and updates. This required me to not use ORM and I happily took that tradeoff to get better query control and performance. This design choice is purely made after I have understood how a relational database (MySQL in my case) stores the data and evaluates the query; definitely not a one-size-fits-all design decision. So, whenever you are designing a schema, always optimize for the most frequent queries and even if it requires going for a composite primary key, it may be worth it. also, knowing a bit about database internals always helps. #AsliEngineering #Databases #SystemDesign

31 Comments

Arpit Bhayani

Sys Design for SDE-1s: https://arpitbhayani.me/sys-design Sys Design for SDE-2s+: https://arpitbhayani.me/course Redis Internals: https://arpitbhayani.me/redis-internals I keep writing and sharing my practical experience and learnings every day, so if you resonate then follow along. I keep it no fluff. no-fluff engineering - youtube.com/c/ArpitBhayani first principle series - https://arpitbhayani.me/first my knowledge base: https://arpitbhayani.me/blogs bookshelf: https://arpitbhayani.me/bookshelf papershelf: https://arpitbhayani.me/papershelf

3 Reactions

Hitesh Garg

Curious• Senior Software Engineer • 19k+• Python • Django • Flask • AWS • Sql • Support-turned-Dev

If you don't use the id column then how do you use foreign keys?

4 Reactions

Syed Shamail

Fullstack JavaScript Engineer with a focus on Microservices, Multicloud,and DevOps

It would be really good if you could share the use case in which you opted to go for non auto-increment ID column.

3 Reactions

Ayush Gupta

You still do require the unique id either Guid or computed key. Interestingly I will try your case to see the performance.

1 Reaction

Arpan Mukherjee

Eng Lead - Founding Team @ Reconect.ai

Composite Pks are more useful that it seems. I have been using (tenant_id, id) kind of fields on postgres (so easily use uuids), as we always read / write data for a single tenant only. This keeps possibilities like partitioning on tenant_id, or even sharding on it (using something like citus or own shard router code). This almost eliminates the noisy neighbour issue at least from query plan perspective as all indexes essentially contains tenant_id.

9 Reactions

Evan Howlett

Senior Fullstack Software Engineer

I've been working in the opposite direction. The data (tires) I work with is very hierarchical -- you have a brand, a model, and then the size in that model. The old data model has an auto-incrementing id for brand, but model was set up to have a dependent, incremental id based on the brand id, causing it to be a composite key. Generally, the sizes can be identified by a manufacturers #, but because there are "white label" tires that get sub-branded, it's not a unique identifier. Thus, the composite key of brand id, line id, and manufacturer's number is the primary key. In the new model, I still have the old keys that I keep as unique keys, however I've moved the primary key to just be a single auto-incrementing id column. You keep the prevented redundancy guarantee, have the ability to query based on either key set (thus allowing queries to be less complex in some scenarios), get better insertion/deletion performance due to the linear nature of the auto-incremental id, and only have the cost of an extra few bytes per record. Really, it's all about knowing the data. Not every dataset is best modeled with a single-column primary key. Not every dataset is best modeled with a composite primary key.

1 Reaction

Vishal Jaiswal, PMP®

Perfectly said Arpit Bhayani . Design the database with forecasting that how much data and which type of data will come, will help you always to do a properly design table structure but as you said that one design not fit at all. Going for surrogate key(Auto increment non business value) or Composite primary key is always a debatable topic. composite primary key Vs Sequence(Surrogate Key) This is a controversial point. If my table with a composite primary key is expected to have millions of rows, the index controlling the composite key can grow up to a point where CRUD operation performance is very degraded. In that case, it is a lot better to use a simple integer ID primary key whose index will be compact enough and establish the necessary DBE constraints to maintain uniqueness. Follow Vishal Jaiswal, PMP® for learning database concepts, building tricks and cracking database technical interviews

Javed Akeeb

frontend developer | React.js | Next.js | AI Automation | UI/UX

Many devs by default go for ORMs . But what i have experienced as a front-end dev is that the queries get incrementally slower as the size of the database increases

2 Reactions

Archishman Sengupta

SDE - 1 Fullstack @StackWealth (YC S21) | ICPC Regionalist | 3x YC dev

but CPK can result in higher index sizes since composite indexes can be larger than single-column indexes, which can also increase storage and I/O ops, especially in the case of write-heavy workloads. how do you handle this complexity and potential performance issues?

2 Reactions

Subham Tripathi

Senior Software Engineer @ Carrefour | Ex- Goldman Sachs

100% i agree with you here, this is something i too discovered while designing a tightly related pointed queries

See more comments

To view or add a comment, sign in

More Relevant Posts

GenexDB IT Solutions Pvt Ltd.

2,434 followers
2mo
Report this post
Optimize Your SQL Queries for Peak Performance! Is a slow-running SQL query affecting your database performance? Here are key optimization strategies to boost efficiency: 🔹 Examine Query Execution Plans Understand how SQL interprets and executes your queries to identify bottlenecks. 🔹 Use Proper Indexing Indexing columns used in WHERE clauses or JOIN conditions can dramatically speed up your queries. 🔹 Avoid SELECT * Queries Fetch only the columns you need to reduce the load on your database. 🔹 Optimize Joins and Subqueries Efficient JOINs and restructuring complex subqueries can make a big difference. 🔹 Use LIMIT and OFFSET Fetching limited data with pagination is more efficient than loading everything at once. Don’t let sluggish queries slow you down—implement these techniques to keep your database fast and responsive! 💡 www.genexdbs.com / +91-8870076562 \ +91-6385312716 #SQLOptimization #DatabaseManagement #QueryPerformance #GenexDBS #DatabaseSupport #MySQL #DataRecovery #BusinessContinuity #MySQLTips #DataRestoration
Like Comment
To view or add a comment, sign in
Satya Raj Kumar

Head-Sales & Business Development for Enterprise Database Managed Services
2mo
Report this post
🚀 Optimize Slow SQL Queries Struggling with slow SQL queries? Try these tips to boost performance: 🔹 Analyze the query execution plan 🔹 Use proper indexing 🔹 Avoid SELECT *, fetch only what’s needed 🔹 Optimize JOINs and subqueries 🔹 Use LIMIT to fetch limited data Keep your database fast and efficient! 💡 #SQLOptimization #DatabasePerformance #GenexDBS #DatabaseSupport

GenexDB IT Solutions Pvt Ltd.

2,434 followers
2mo

Optimize Your SQL Queries for Peak Performance! Is a slow-running SQL query affecting your database performance? Here are key optimization strategies to boost efficiency: 🔹 Examine Query Execution Plans Understand how SQL interprets and executes your queries to identify bottlenecks. 🔹 Use Proper Indexing Indexing columns used in WHERE clauses or JOIN conditions can dramatically speed up your queries. 🔹 Avoid SELECT * Queries Fetch only the columns you need to reduce the load on your database. 🔹 Optimize Joins and Subqueries Efficient JOINs and restructuring complex subqueries can make a big difference. 🔹 Use LIMIT and OFFSET Fetching limited data with pagination is more efficient than loading everything at once. Don’t let sluggish queries slow you down—implement these techniques to keep your database fast and responsive! 💡 www.genexdbs.com / +91-8870076562 \ +91-6385312716 #SQLOptimization #DatabaseManagement #QueryPerformance #GenexDBS #DatabaseSupport #MySQL #DataRecovery #BusinessContinuity #MySQLTips #DataRestoration
Like Comment
To view or add a comment, sign in
Stephen Planck

SQL Server Customer Engineer
1mo
Report this post
Today's blog looks at last page contention in SQL Server and ways to mitigate it. Last page contention happens when multiple sessions attempt to insert records into the last page of an index simultaneously. This contention can lead to significant wait times and reduced performance, particularly in tables with ever-increasing key columns. You can read more at the following link: https://lnkd.in/gvV69AGj #sqlserver #sqlserverdba #sqltips

Diagnosing and Resolving Last Page Insert Contention in SQL Server - SQL Table Talk

https://www.sqltabletalk.com

1 Comment
Like Comment
To view or add a comment, sign in
MSSQLTips.com

3,110 followers
2w
Report this post
Today's Tip | Build Polymorphic Associations in SQL Server with Foreign Keys >> https://lnkd.in/eqcaXGyt Learn about polymorphic associations in SQL Server and how to relate a foreign key reference to multiple SQL Server tables. Author: Jared Westover #sql #sqlserver #microsoftsqlserver #mssql #mssqlserver #databasedesign #polymorphicassociation #foreignkeys

Build Polymorphic Associations in SQL Server with Foreign Keys

mssqltips.com
Like Comment
To view or add a comment, sign in
Rezaul Karim Shaon

Software Engineer || Python, Django, DRF, WebSocket, JavaScript, SQL, PostgreSQL, MySQL || Problem Solver, Open Source Contributor
5mo Edited
Report this post
#Schema: A schema is a named collection of database objects, including tables, views, indexes, functions, and other objects. Schemas provide a way to organize database objects into logical groups, making it easier to manage permissions, and avoid name conflicts. In PostgreSQL, how to use, switch schema for search data, and modify search_path here you can visit. Link: https://lnkd.in/gPkRwXsP #database #postgresql #schema #search_path #switchschema

1 Comment
Like Comment
To view or add a comment, sign in
Jeremy Kadlec Jeremy Kadlec is an Influencer

MSSQLTips.com Co-Founder, Editor and Author - LinkedIn Top Voice
2w
Report this post
Today's Tip | Build Polymorphic Associations in SQL Server with Foreign Keys >> https://lnkd.in/ef4m7Fhh Learn about polymorphic associations in SQL Server and how to relate a foreign key reference to multiple SQL Server tables. Author: Jared Westover #sql #sqlserver #microsoftsqlserver #mssql #mssqlserver #databasedesign #polymorphicassociation #foreignkeys

Build Polymorphic Associations in SQL Server with Foreign Keys

mssqltips.com
Like Comment
To view or add a comment, sign in
SQL Governor

460 followers
7mo
Report this post
Our Founder & CEO Jani K. Savolainen continues his series of blogposts revealing the performance secrets of #SQLServer and sharing them to the #sqlfamily. This time it is about OR vs. UNION ALL. And once again - there will be a massive performance difference. These are the tips and tricks every #DBA and database developer should know. https://lnkd.in/esKy8qHm

SQL Server performance insights – OR vs UNION ALL is not always what it seems

sqlgovernor.com
Like Comment
To view or add a comment, sign in
Joseph C.

SQL Server DBA & Editor
5mo
Report this post
Understanding Parallelism Wait Statistics in SQL Server

Understanding Parallelism Wait Statistics in SQL Server

https://thedbahub.com
Like Comment
To view or add a comment, sign in
Shiney Jeyachandran

Digital Marketing Specialist @ Mydbops | Driving Digital Success with Strategic Marketing Solutions
1w
Report this post
Working with data is not always straightforward, especially when it comes to understanding how values are distributed. This is where MySQL Histograms come into play, helping the optimizer create better query plans by understanding data distribution. With MySQL 8.4, things are even better: * Histograms now update automatically with data changes. * No more manual updates or re-analysis. * Smarter query performance with less effort. Check out the blog to learn more: https://lnkd.in/g9A7TRxn #mysql #mysql8 #mydbops #database #databasemangement #dba #dbms #histograms #performance
Like Comment
To view or add a comment, sign in
Shashank Pandey Shashank Pandey is an Influencer

Ex-PJM Intern at Microsoft | Frontend Developer @ Resellpur | React/Next JS | LinkedIn ICKP ''23 | Python | Building Skillvalley | B.Tech CSE '25
8mo
Report this post
𝐃𝐨 𝐲𝐨𝐮 𝐤𝐧𝐨𝐰 𝐚𝐛𝐨𝐮𝐭 𝐈𝐧𝐝𝐞𝐱𝐢𝐧𝐠 𝐢𝐧 𝐒𝐐𝐋 𝐃𝐚𝐭𝐚𝐛𝐚𝐬𝐞 ? Assume that you have a book with more than 10000 pages covering a large number topics, but that book does not have an 𝘪𝘯𝘥𝘦𝘹 𝘱𝘢𝘨𝘦. So how will you get to know which topic is on what page number. Simply you would have to go through almost each and every page to reach the page you want. Similar is the case with 𝘥𝘢𝘵𝘢𝘣𝘢𝘴𝘦 𝘵𝘢𝘣𝘭𝘦𝘴, assume there is a table with columns Name, Email and Age, and you are frequently accessing Name and Email from this table. If this grows big, say more than 1 𝐥𝐚𝐜+ rows, the 𝘥𝘢𝘵𝘢𝘣𝘢𝘴𝘦 𝘦𝘯𝘨𝘪𝘯𝘦 would have to scan through the entire table each time you execute a query to find the desired records. This can be really inefficient. However, by creating an index on Name and Email column, the database engine can quickly locate the relevant records, which significantly reduces the time taken to execute the query. Also, indexes should be created carefully and only on those columns of the table where it is very much needed, as its overuse can lead to Storage overhead, Maintenance overhead and decrease in overall application performance. Follow Shashank Pandey for more ✨ #database #SQL #backend #mysql #data
Like Comment
To view or add a comment, sign in

202,815 followers

View Profile Connect

Arpit Bhayani’s Post

More from this author

The best resource does not exist.

It's not about what you know, but about how you think

Roadmaps are just satisfying your urge to follow a syllabus

Explore topics