Database Design and Relational Theory: Normal Forms and All That Jazz

Ebook726 pages7 hours

Database Design and Relational Theory: Normal Forms and All That Jazz

Name: Database Design and Relational Theory: Normal Forms and All That Jazz
Brand: Apress
Rating: 4.0 (2 reviews)

By C. J. Date

Rating: 4 out of 5 stars

4/5

()

Read preview

About this ebook

Create database designs that scale, meet business requirements, and inherently work toward keeping your data structured and usable in the face of changing business models and software systems.
This book is about database design theory. Design theory is the scientific foundation for database design, just as the relational model is the scientific foundation for database technology in general. Databases lie at the heart of so much of what we do in the computing world that negative impacts of poor design can be extraordinarily widespread.

This second edition includes greatly expanded coverage of exotic and little understood normal forms such as: essential tuple normal form (ETNF), redundancy free normal form (RFNF), superkey normal form (SKNF), sixth normal form (6NF), and domain key normal form (DKNF). Also included are new appendixes, including one that provides an in-depth look into the crucial notion of data consistency.Sequencing of topics has been improved, and many explanations and examples have been rewritten and clarified based upon the author’s teaching of the content in instructor-led courses.
This book aims to be different from other books on design by bridging the gap between the theory of design and the practice of design. The book explains theory in a way that practitioners should be able to understand, and it explains why that theory is of considerable practical importance. Reading this book provides you with an important theoretical grounding on which to do the practical work of database design. Reading the book also helps you in going to and understanding the more academic texts as you build your base of knowledge and expertise. Anyone with a professional interest in database design can benefit from using this book as a stepping-stone toward a more rigorous design approach and more lasting database models.

What You Will Learn

Understand what design theory is and is not
Be aware of the two different goals of normalization
Know which normal forms are truly significant
Apply design theory in practice
Be familiar with techniques for dealing with redundancy
Understand what consistency is and why it is crucially important

Who This Book Is For
Those having a professional interest in database design, including data and database administrators; educators and students specializing in database matters; information modelers and database designers; DBMS designers, implementers, and other database vendor personnel; and database consultants. The book is product independent.

Skip carousel

Databases

LanguageEnglish

PublisherApress

Release dateDec 12, 2019

ISBN9781484255407

Author

C. J. Date

C. J. Date has a unique stature in the database industry. Author or coauthor of well over 30 books on database management (including the bestselling An Introduction to Database Systems, currently in its 8th edition), he enjoys a reputation that’s second to none for his ability to explain complex technical issues in a clear and understandable fashion. He was inducted into the Computing Industry Hall of Fame in 2004.

Related authors

Skip carousel

Related to Database Design and Relational Theory

Related ebooks

Skip carousel

Beginning Oracle Database 12c Administration: From Novice to Professional
Ebook
Beginning Oracle Database 12c Administration: From Novice to Professional
byIgnatius Fernandez
Rating: 0 out of 5 stars
0 ratings
Oracle DBA Mentor: Succeeding as an Oracle Database Administrator
Ebook
Oracle DBA Mentor: Succeeding as an Oracle Database Administrator
byBrian Peasland
Rating: 0 out of 5 stars
0 ratings
Data Teams: A Unified Management Model for Successful Data-Focused Teams
Ebook
Data Teams: A Unified Management Model for Successful Data-Focused Teams
byJesse Anderson
Rating: 0 out of 5 stars
0 ratings
Exploring C++20: The Programmer's Introduction to C++
Ebook
Exploring C++20: The Programmer's Introduction to C++
byRay Lischner
Rating: 0 out of 5 stars
0 ratings
SQL Server 2019 Revealed: Including Big Data Clusters and Machine Learning
Ebook
SQL Server 2019 Revealed: Including Big Data Clusters and Machine Learning
byBob Ward
Rating: 0 out of 5 stars
0 ratings
Complete Guide to Test Automation: Techniques, Practices, and Patterns for Building and Maintaining Effective Software Projects
Ebook
Complete Guide to Test Automation: Techniques, Practices, and Patterns for Building and Maintaining Effective Software Projects
byArnon Axelrod
Rating: 0 out of 5 stars
0 ratings
Scalable Big Data Architecture: A practitioners guide to choosing relevant Big Data architecture
Ebook
Scalable Big Data Architecture: A practitioners guide to choosing relevant Big Data architecture
byBahaaldine Azarmi
Rating: 2 out of 5 stars
2/5
The SQL Server DBA’s Guide to Docker Containers: Agile Deployment without Infrastructure Lock-in
Ebook
The SQL Server DBA’s Guide to Docker Containers: Agile Deployment without Infrastructure Lock-in
byEdwin M Sarmiento
Rating: 0 out of 5 stars
0 ratings
Assessing and Improving Prediction and Classification: Theory and Algorithms in C++
Ebook
Assessing and Improving Prediction and Classification: Theory and Algorithms in C++
byTimothy Masters
Rating: 0 out of 5 stars
0 ratings
Pro Machine Learning Algorithms: A Hands-On Approach to Implementing Algorithms in Python and R
Ebook
Pro Machine Learning Algorithms: A Hands-On Approach to Implementing Algorithms in Python and R
byV Kishore Ayyadevara
Rating: 0 out of 5 stars
0 ratings
Numerical Python: Scientific Computing and Data Science Applications with Numpy, SciPy and Matplotlib
Ebook
Numerical Python: Scientific Computing and Data Science Applications with Numpy, SciPy and Matplotlib
byRobert Johansson
Rating: 0 out of 5 stars
0 ratings
Developing Applications with Azure Active Directory: Principles of Authentication and Authorization for Architects and Developers
Ebook
Developing Applications with Azure Active Directory: Principles of Authentication and Authorization for Architects and Developers
byManas Mayank
Rating: 0 out of 5 stars
0 ratings
Applied Natural Language Processing with Python: Implementing Machine Learning and Deep Learning Algorithms for Natural Language Processing
Ebook
Applied Natural Language Processing with Python: Implementing Machine Learning and Deep Learning Algorithms for Natural Language Processing
byTaweh Beysolow II
Rating: 0 out of 5 stars
0 ratings
MongoDB Recipes: With Data Modeling and Query Building Strategies
Ebook
MongoDB Recipes: With Data Modeling and Query Building Strategies
bySubhashini Chellappan
Rating: 0 out of 5 stars
0 ratings
Data Modeling Fundamentals: A Practical Guide for IT Professionals
Ebook
Data Modeling Fundamentals: A Practical Guide for IT Professionals
byPaulraj Ponniah
Rating: 0 out of 5 stars
0 ratings
Understanding Azure Data Factory: Operationalizing Big Data and Advanced Analytics Solutions
Ebook
Understanding Azure Data Factory: Operationalizing Big Data and Advanced Analytics Solutions
bySudhir Rawat
Rating: 0 out of 5 stars
0 ratings
Web App Development and Real-Time Web Analytics with Python: Develop and Integrate Machine Learning Algorithms into Web Apps
Ebook
Web App Development and Real-Time Web Analytics with Python: Develop and Integrate Machine Learning Algorithms into Web Apps
byTshepo Chris Nokeri
Rating: 0 out of 5 stars
0 ratings
Architecting CSS: The Programmer’s Guide to Effective Style Sheets
Ebook
Architecting CSS: The Programmer’s Guide to Effective Style Sheets
byMartine Dowden
Rating: 0 out of 5 stars
0 ratings
Practical API Architecture and Development with Azure and AWS: Design and Implementation of APIs for the Cloud
Ebook
Practical API Architecture and Development with Azure and AWS: Design and Implementation of APIs for the Cloud
byThurupathan Vijayakumar
Rating: 0 out of 5 stars
0 ratings
Data Engineering with Apache Spark, Delta Lake, and Lakehouse: Create scalable pipelines that ingest, curate, and aggregate complex data in a timely and secure way
Ebook
Data Engineering with Apache Spark, Delta Lake, and Lakehouse: Create scalable pipelines that ingest, curate, and aggregate complex data in a timely and secure way
byManoj Kukreja
Rating: 0 out of 5 stars
0 ratings
Generating a New Reality: From Autoencoders and Adversarial Networks to Deepfakes
Ebook
Generating a New Reality: From Autoencoders and Adversarial Networks to Deepfakes
byMicheal Lanham
Rating: 0 out of 5 stars
0 ratings
Deep Belief Nets in C++ and CUDA C: Volume 1: Restricted Boltzmann Machines and Supervised Feedforward Networks
Ebook
Deep Belief Nets in C++ and CUDA C: Volume 1: Restricted Boltzmann Machines and Supervised Feedforward Networks
byTimothy Masters
Rating: 0 out of 5 stars
0 ratings
Building REST APIs with Flask: Create Python Web Services with MySQL
Ebook
Building REST APIs with Flask: Create Python Web Services with MySQL
byKunal Relan
Rating: 0 out of 5 stars
0 ratings
Learning Azure DocumentDB
Ebook
Learning Azure DocumentDB
byBecker Riccardo
Rating: 0 out of 5 stars
0 ratings
Beginning SQL Server Reporting Services
Ebook
Beginning SQL Server Reporting Services
byKathi Kellenberger
Rating: 0 out of 5 stars
0 ratings
Pro SQL Server Internals
Ebook
Pro SQL Server Internals
byDmitri Korotkevitch
Rating: 0 out of 5 stars
0 ratings
Machine Learning with PySpark: With Natural Language Processing and Recommender Systems
Ebook
Machine Learning with PySpark: With Natural Language Processing and Recommender Systems
byPramod Singh
Rating: 0 out of 5 stars
0 ratings
Foundations of Python Network Programming
Ebook
Foundations of Python Network Programming
byBrandon Rhodes
Rating: 4 out of 5 stars
4/5
Python for Marketing Research and Analytics
Ebook
Python for Marketing Research and Analytics
byJason S. Schwarz
Rating: 0 out of 5 stars
0 ratings
Dynamic SQL: Applications, Performance, and Security in Microsoft SQL Server
Ebook
Dynamic SQL: Applications, Performance, and Security in Microsoft SQL Server
byEdward Pollack
Rating: 0 out of 5 stars
0 ratings

Databases For You

Skip carousel

SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
Behind Every Good Decision: How Anyone Can Use Business Analytics to Turn Data into Profitable Insight
Ebook
Behind Every Good Decision: How Anyone Can Use Business Analytics to Turn Data into Profitable Insight
byPiyanka Jain
Rating: 5 out of 5 stars
5/5
Access 2010 All-in-One For Dummies
Ebook
Access 2010 All-in-One For Dummies
byAlison Barrows
Rating: 4 out of 5 stars
4/5
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
Ebook
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
byAlexander Cooper
Rating: 2 out of 5 stars
2/5
Access 2019 For Dummies
Ebook
Access 2019 For Dummies
byLaurie A. Ulrich
Rating: 0 out of 5 stars
0 ratings
Learn Git in a Month of Lunches
Ebook
Learn Git in a Month of Lunches
byRick Umali
Rating: 0 out of 5 stars
0 ratings
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
Practical Data Analysis
Ebook
Practical Data Analysis
byHector Cuesta
Rating: 4 out of 5 stars
4/5
Learn SQL in 24 Hours
Ebook
Learn SQL in 24 Hours
byAlex Nordeen
Rating: 5 out of 5 stars
5/5
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
Ebook
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
byWilliam Sullivan
Rating: 5 out of 5 stars
5/5
Access 2016 For Dummies
Ebook
Access 2016 For Dummies
byLaurie A. Ulrich
Rating: 0 out of 5 stars
0 ratings
COMPUTER SCIENCE FOR ROOKIES
Ebook
COMPUTER SCIENCE FOR ROOKIES
byAngel Bahabwa
Rating: 0 out of 5 stars
0 ratings
Blockchain Basics: A Non-Technical Introduction in 25 Steps
Ebook
Blockchain Basics: A Non-Technical Introduction in 25 Steps
byDaniel Drescher
Rating: 4 out of 5 stars
4/5
Go in Action
Ebook
Go in Action
byErik St. Martin
Rating: 5 out of 5 stars
5/5
LINUX: Beginner's Crash Course. Your Step-By-Step Guide To Learning The Linux Operating System And Command Line Easy & Fast!
Ebook
LINUX: Beginner's Crash Course. Your Step-By-Step Guide To Learning The Linux Operating System And Command Line Easy & Fast!
byJeremy Li
Rating: 3 out of 5 stars
3/5
Deep Learning with Python: A Comprehensive guide to Building and Training Deep Neural Networks using Python and popular Deep Learning Frameworks
Ebook
Deep Learning with Python: A Comprehensive guide to Building and Training Deep Neural Networks using Python and popular Deep Learning Frameworks
byBrian Murray
Rating: 0 out of 5 stars
0 ratings
PostgreSQL Administration Essentials
Ebook
PostgreSQL Administration Essentials
byHans-Jürgen Schönig
Rating: 0 out of 5 stars
0 ratings
Learn SQL Server Administration in a Month of Lunches
Ebook
Learn SQL Server Administration in a Month of Lunches
byDon Jones
Rating: 3 out of 5 stars
3/5
Managing Data Using Excel
Ebook
Managing Data Using Excel
byMark Gardener
Rating: 5 out of 5 stars
5/5
IMS-DB Basic Training For Application Developers
Ebook
IMS-DB Basic Training For Application Developers
byRobert Wingate
Rating: 0 out of 5 stars
0 ratings
Visualizing Graph Data
Ebook
Visualizing Graph Data
byCorey Lanum
Rating: 0 out of 5 stars
0 ratings
Schaum’s Outline of Fundamentals of SQL Programming
Ebook
Schaum’s Outline of Fundamentals of SQL Programming
byRamon Mata-Toledo
Rating: 3 out of 5 stars
3/5
The Data Model Resource Book: Volume 3: Universal Patterns for Data Modeling
Ebook
The Data Model Resource Book: Volume 3: Universal Patterns for Data Modeling
byLen Silverston
Rating: 0 out of 5 stars
0 ratings
Data Analysis with R
Ebook
Data Analysis with R
byFischetti Tony
Rating: 5 out of 5 stars
5/5
Base SAS Interview Questions You'll Most Likely Be Asked
Ebook
Base SAS Interview Questions You'll Most Likely Be Asked
byVibrant Publishers
Rating: 0 out of 5 stars
0 ratings
Starting Database Administration: Oracle DBA
Ebook
Starting Database Administration: Oracle DBA
byanuragbaruah84
Rating: 3 out of 5 stars
3/5
Practical Data Science with R, Second Edition
Ebook
Practical Data Science with R, Second Edition
byJohn Mount
Rating: 4 out of 5 stars
4/5
Access for Beginners: Access Essentials, #1
Ebook
Access for Beginners: Access Essentials, #1
byM.L. Humphrey
Rating: 0 out of 5 stars
0 ratings
Python and SQLite Development
Ebook
Python and SQLite Development
byAgus Kurniawan
Rating: 0 out of 5 stars
0 ratings
Serverless Architectures on AWS, Second Edition
Ebook
Serverless Architectures on AWS, Second Edition
byPeter Sbarski
Rating: 5 out of 5 stars
5/5

Related podcast episodes

Skip carousel

Cloud Education Made Easy with Katie Bullard: Katie Bullard is the president of A Cloud Guru, a cloud education platform. She’s also a board member at Conservice, ChildCareCRM, and Journyx, Inc. Katie previously served as president and chief growth officer at ZoomInfo (formerly DiscoverOrg), VP of ma
UNLIMITED
Cloud Education Made Easy with Katie Bullard: Katie Bullard is the president of A Cloud Guru, a cloud education platform. She’s also a board member at Conservice, ChildCareCRM, and Journyx, Inc. Katie previously served as president and chief growth officer at ZoomInfo (formerly DiscoverOrg), VP of ma
byScreaming in the Cloud
0 ratings
0% found this document useful
Helping Teacher's Bring Python Into The Classroom With Nicholas Tollervey: Helping Teacher's Bring Python Into The Classroom (Interview)
UNLIMITED
Helping Teacher's Bring Python Into The Classroom With Nicholas Tollervey: Helping Teacher's Bring Python Into The Classroom (Interview)
byThe Python Podcast.__init__
0 ratings
0% found this document useful
All Things Azure with Dwayne Monroe: Dwayne Monroe is a senior cloud architect at Cloudreach, an organization that helps enterprises maximize their cloud investments, who’s focused on Azure. Prior to joining Cloudreach, Dwayne worked as a senior Microsoft and cloud architect at High Availabi
UNLIMITED
All Things Azure with Dwayne Monroe: Dwayne Monroe is a senior cloud architect at Cloudreach, an organization that helps enterprises maximize their cloud investments, who’s focused on Azure. Prior to joining Cloudreach, Dwayne worked as a senior Microsoft and cloud architect at High Availabi
byScreaming in the Cloud
0 ratings
0% found this document useful
Hasty Treat - Seven Interesting JavaScript Proposals - Async Do, JSON Modules, Immutable Array Methods, and More!: In this Hasty Treat, Scott and Wes talk about seven new JavaScript proposals — what they do, where they’re at, and how you might use them. Deque - Sponsor Deque’s axe DevTools makes accessibility testing easy and doesn’t require special...
UNLIMITED
Hasty Treat - Seven Interesting JavaScript Proposals - Async Do, JSON Modules, Immutable Array Methods, and More!: In this Hasty Treat, Scott and Wes talk about seven new JavaScript proposals — what they do, where they’re at, and how you might use them. Deque - Sponsor Deque’s axe DevTools makes accessibility testing easy and doesn’t require special...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Yugabyte and Database Innovations with Karthik Ranganathan: This week Corey is joined by Karthik Ranganathan, CTO and Co-Founder of Yugabyte, to talk about databases of which YugabyteDB is one of the best. Karthik started at Facebook building distributed databases and now has moved onto building even more! Why? We
UNLIMITED
Yugabyte and Database Innovations with Karthik Ranganathan: This week Corey is joined by Karthik Ranganathan, CTO and Co-Founder of Yugabyte, to talk about databases of which YugabyteDB is one of the best. Karthik started at Facebook building distributed databases and now has moved onto building even more! Why? We
byScreaming in the Cloud
0 ratings
0% found this document useful
Going with GraphQL: featuring Mark Sandstrom & Ben Kraft
UNLIMITED
Going with GraphQL: featuring Mark Sandstrom & Ben Kraft
byGo Time: Golang, Software Engineering
0 ratings
0% found this document useful
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
UNLIMITED
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
byCatalog & Cocktails: The Honest, No-BS Data Podcast
0 ratings
0% found this document useful
235: Pair programming with Ben Orenstein & Tuple: In this episode, Kaushik goes solo and interviews Ben Orenstein. Ben is a prolific Ruby developer, an amazing conference speaker, an ardent vim-ster, and now the CEO of Tuple. Kaushik has been a big fan of Ben's work and was super stoked to talk to Ben and pick his brains on a host of topics: starting the company Tuple, pair programming in general, learning different programming languages and technology, giving better conference talks and more! This episode is chock full of wisdom from Ben. Enjoy!
UNLIMITED
235: Pair programming with Ben Orenstein & Tuple: In this episode, Kaushik goes solo and interviews Ben Orenstein. Ben is a prolific Ruby developer, an amazing conference speaker, an ardent vim-ster, and now the CEO of Tuple. Kaushik has been a big fan of Ben's work and was super stoked to talk to Ben and pick his brains on a host of topics: starting the company Tuple, pair programming in general, learning different programming languages and technology, giving better conference talks and more! This episode is chock full of wisdom from Ben. Enjoy!
byFragmented - Android Developer Podcast
0 ratings
0% found this document useful
JavaScript is the CO2 of the web: with Chris Ferdinandi, "The Vanilla JavaScript guy"
UNLIMITED
JavaScript is the CO2 of the web: with Chris Ferdinandi, "The Vanilla JavaScript guy"
byJS Party: JavaScript, CSS, Web Development
0 ratings
0% found this document useful
400: How To Search: Joël shares he has been getting more into long-form reading. Stephanie talks about the challenges she faced in a new project that required integrating with another company's system. Together, they delve into the importance of search techniques for developers, covering various approaches to finding information online.
UNLIMITED
400: How To Search: Joël shares he has been getting more into long-form reading. Stephanie talks about the challenges she faced in a new project that required integrating with another company's system. Together, they delve into the importance of search techniques for developers, covering various approaches to finding information online.
byThe Bike Shed
0 ratings
0% found this document useful
#56 The 5 Biggest Communication Traps with Davina Stanley: Kevin Appleby is joined by Clarity First Founder, Davina Stanley, to discuss the 5 biggest communication traps. Davina goes into these in detail and how we can avoid falling into them. 1. Assume that shorter is always better
UNLIMITED
#56 The 5 Biggest Communication Traps with Davina Stanley: Kevin Appleby is joined by Clarity First Founder, Davina Stanley, to discuss the 5 biggest communication traps. Davina goes into these in detail and how we can avoid falling into them. 1. Assume that shorter is always better
byGrowCFO Show
0 ratings
0% found this document useful
How to use storyboarding to create more engaging content
UNLIMITED
How to use storyboarding to create more engaging content
byPump Up Your Pod
0 ratings
0% found this document useful
College Composition and Communication: Welcome to Mere Rhetoric, the podcast for beginners and insiders about the ideas, people and movements who have shaped rhetorical history. Uh, I guess including recent history, because today we’re going to talk about the February 2017 issue of...
UNLIMITED
College Composition and Communication: Welcome to Mere Rhetoric, the podcast for beginners and insiders about the ideas, people and movements who have shaped rhetorical history. Uh, I guess including recent history, because today we’re going to talk about the February 2017 issue of...
byMere Rhetoric
0 ratings
0% found this document useful
161. The Science of Human Design: This week, we're looking under the hood of Human Design and taking a peak at how Ra blended some different systems to create Human Design. We'll also touch on the more "scientific" aspect of the system - things like neutrinos and how they impact us -...
UNLIMITED
161. The Science of Human Design: This week, we're looking under the hood of Human Design and taking a peak at how Ra blended some different systems to create Human Design. We'll also touch on the more "scientific" aspect of the system - things like neutrinos and how they impact us -...
byThat Projector Life
0 ratings
0% found this document useful
249. How to Take Smart Notes Book Summary: If you’re a fan of using to stay on top of all the, well, things you need to get done – you’ll love for staying on top of all the things you want to learn. I’ll give you an introduction – in my own words – in this How to Take Smart...
UNLIMITED
249. How to Take Smart Notes Book Summary: If you’re a fan of using to stay on top of all the, well, things you need to get done – you’ll love for staying on top of all the things you want to learn. I’ll give you an introduction – in my own words – in this How to Take Smart...
byLove Your Work
100%
100% found this document useful
Dr. Paul Lessard - Categorical/Structured Deep Learning
UNLIMITED
Dr. Paul Lessard - Categorical/Structured Deep Learning
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
#090 - Become a more effective learner with Russell Helmstedter
UNLIMITED
#090 - Become a more effective learner with Russell Helmstedter
byPybites Podcast
0 ratings
0% found this document useful
Ep: 108 Steven Pinker’s ”Rationality”Chapter 3 ”Logic and Critical Thinking” - reflections and analysis.: This podcast is about Steven Pinker's new book "Rationality". I read a small number of brief excerpts from the book itself, alongside commenting, criticising and reviewing the content of the third chapter “Logic and Critical Thinking”. The first half (o...
UNLIMITED
Ep: 108 Steven Pinker’s ”Rationality”Chapter 3 ”Logic and Critical Thinking” - reflections and analysis.: This podcast is about Steven Pinker's new book "Rationality". I read a small number of brief excerpts from the book itself, alongside commenting, criticising and reviewing the content of the third chapter “Logic and Critical Thinking”. The first half (o...
byToKCast
0 ratings
0% found this document useful
Episode 150 – Roaring News: In this news episode, we use a nice little article on how you can help keep open source sustainable as a structure for a broader discussion on this subject. The second subject this time goes another round on the "data engineers are not data scientists"...
UNLIMITED
Episode 150 – Roaring News: In this news episode, we use a nice little article on how you can help keep open source sustainable as a structure for a broader discussion on this subject. The second subject this time goes another round on the "data engineers are not data scientists"...
byRoaring Elephant
0 ratings
0% found this document useful
Ep 188: Write to Discover What You Really Want to Say: [Ep 188] - In this series, you’ve discovered more about yourself through writing—you may have begun to heal emotional wounds. The act of writing has helped you find the courage to continue to write. Through writing,
UNLIMITED
Ep 188: Write to Discover What You Really Want to Say: [Ep 188] - In this series, you’ve discovered more about yourself through writing—you may have begun to heal emotional wounds. The act of writing has helped you find the courage to continue to write. Through writing,
byAnn Kroeker, Writing Coach
0 ratings
0% found this document useful
Writer's Block Series: Why Discussion and Feedback Are A Writer's Secret Weapon (And How Professionals Use it to Their Advantage) - Part Three: How do you speed up your writing process and what causes it to slow down to a crawl? Surprisingly, discussion and feedback play an incredibly powerful role in unblocking Writer's Block. The more you're stuck, the greater the reason to invite...
UNLIMITED
Writer's Block Series: Why Discussion and Feedback Are A Writer's Secret Weapon (And How Professionals Use it to Their Advantage) - Part Three: How do you speed up your writing process and what causes it to slow down to a crawl? Surprisingly, discussion and feedback play an incredibly powerful role in unblocking Writer's Block. The more you're stuck, the greater the reason to invite...
byThe Three Month Vacation Podcast
0 ratings
0% found this document useful
Unlocking AI’s Capabilities with Rod Naquin Transformative Principal 583
UNLIMITED
Unlocking AI’s Capabilities with Rod Naquin Transformative Principal 583
byTransformative Principal
0 ratings
0% found this document useful
Ep 175: How to Use Lists to Transform Your Writing (and your life): Tis the season for lists, even for those who aren't naturally checklist and to-do list types. For the holidays, people will make packing lists, shopping lists, cleaning lists, address lists, and wish lists. - Lists are useful and practical,
UNLIMITED
Ep 175: How to Use Lists to Transform Your Writing (and your life): Tis the season for lists, even for those who aren't naturally checklist and to-do list types. For the holidays, people will make packing lists, shopping lists, cleaning lists, address lists, and wish lists. - Lists are useful and practical,
byAnn Kroeker, Writing Coach
0 ratings
0% found this document useful
File Management For Graphic Designers - RD022: The Amazing Task of File Management! A little to enthusiastic? Oh well, can't blame a guy for trying. The fact is file management is probably one of the most boring tasks we do as graphic designers. Boring, but necessary if we want to...
UNLIMITED
File Management For Graphic Designers - RD022: The Amazing Task of File Management! A little to enthusiastic? Oh well, can't blame a guy for trying. The fact is file management is probably one of the most boring tasks we do as graphic designers. Boring, but necessary if we want to...
byResourceful Designer: Strategies for running a graphic design business
0 ratings
0% found this document useful
Writing Studies Research in Practice: Welcome to Mere Rhetoric, the podcast for beginners and outsider about the ideas, people and movements who have shaped rhetorical history. Today we’re going to talk about the method to the madness, if madness were writing studies research. That’s...
UNLIMITED
Writing Studies Research in Practice: Welcome to Mere Rhetoric, the podcast for beginners and outsider about the ideas, people and movements who have shaped rhetorical history. Today we’re going to talk about the method to the madness, if madness were writing studies research. That’s...
byMere Rhetoric
0 ratings
0% found this document useful
What You're Building When You Choose to Build a Second Mind (Zettelkasten)
UNLIMITED
What You're Building When You Choose to Build a Second Mind (Zettelkasten)
byThe Scott Scheper Podcast
0 ratings
0% found this document useful
285: BSD Strategy: Strategic thinking to keep FreeBSD relevant, reflecting on the soul of a new machine, 10GbE Benchmarks On Nine Linux Distros and FreeBSD, NetBSD integrating LLVM sanitizers in base, FreeNAS 11.2 distrowatch review, and more.
UNLIMITED
285: BSD Strategy: Strategic thinking to keep FreeBSD relevant, reflecting on the soul of a new machine, 10GbE Benchmarks On Nine Linux Distros and FreeBSD, NetBSD integrating LLVM sanitizers in base, FreeNAS 11.2 distrowatch review, and more.
byBSD Now
0 ratings
0% found this document useful
Dr. Kathy Lund Dean - Merging Into Traffic
UNLIMITED
Dr. Kathy Lund Dean - Merging Into Traffic
byPhronesis: Practical Wisdom for Leaders with Scott Allen
0 ratings
0% found this document useful
Kees van Deemter, “Computational Models of Referring: A Study in Cognitive Science” (MIT Press, 2016): Sometimes we have to depend on philosophy to explain to us why something apparently simple is in fact extremely complicated. The way we use referring expressions – things that pick out the entities we want to talk about, such as “Mary”,
UNLIMITED
Kees van Deemter, “Computational Models of Referring: A Study in Cognitive Science” (MIT Press, 2016): Sometimes we have to depend on philosophy to explain to us why something apparently simple is in fact extremely complicated. The way we use referring expressions – things that pick out the entities we want to talk about, such as “Mary”,
byNew Books in Psychology
0 ratings
0% found this document useful
Kees van Deemter, “Computational Models of Referring: A Study in Cognitive Science” (MIT Press, 2016): Sometimes we have to depend on philosophy to explain to us why something apparently simple is in fact extremely complicated. The way we use referring expressions – things that pick out the entities we want to talk about, such as “Mary”,
UNLIMITED
Kees van Deemter, “Computational Models of Referring: A Study in Cognitive Science” (MIT Press, 2016): Sometimes we have to depend on philosophy to explain to us why something apparently simple is in fact extremely complicated. The way we use referring expressions – things that pick out the entities we want to talk about, such as “Mary”,
byNew Books in Language
0 ratings
0% found this document useful

Skip carousel

Rise Of The Robots
Linux Format
UNLIMITED
Rise Of The Robots
Jan 12, 2021
7 min read
HotPicks
Linux Format
UNLIMITED
HotPicks
Dec 15, 2020
13 min read
VisionFive V1 RISC-V SBC on sale
Linux Format
UNLIMITED
VisionFive V1 RISC-V SBC on sale
May 3, 2022
1 min read
What Is The Future Of Game Streaming Now That Stadia Is Dead?
APC
UNLIMITED
What Is The Future Of Game Streaming Now That Stadia Is Dead?
Oct 31, 2022
Once hyped as being ‘the future of gaming’, the Google Stadia game streaming service was officially, just three years after launch and before even making it to Australian shores. When game streaming first launched we did have some apprehension about
2 min read
Build a Better nginx Reverse Proxy
Maximum PC
UNLIMITED
Build a Better nginx Reverse Proxy
Feb 4, 2020
4 min read
Jump to LineageOS
Linux Format
UNLIMITED
Jump to LineageOS
Feb 7, 2023
5 min read
Add Military-level Security To Any Project
Linux Format
UNLIMITED
Add Military-level Security To Any Project
Aug 27, 2019
7 min read
Build Your Own URL Shortening Service
Linux Format
UNLIMITED
Build Your Own URL Shortening Service
May 4, 2021
7 min read
Tackling Terminal Tabular Table Tools!
Linux Format
UNLIMITED
Tackling Terminal Tabular Table Tools!
Jan 10, 2023
9 min read
Research Logs:
Family Tree UK
UNLIMITED
Research Logs:
Mar 8, 2024
You’ve probably heard the story of Theseus and the Minotaur: how the young hero wound his way through a fiendish labyrinth, to slay the fearsome beast hidden in its confines. But do you recall how Theseus escaped from the maze, when others had been t
9 min read
Writing Your First Article
Writer's Digest
UNLIMITED
Writing Your First Article
Sep 18, 2019
After all your hard work and writing the perfect query, you now have an article writing assignment. The editor has detailed the word count, deadline, the rights, and the payment, and now it’s time to write the article. Keep in mind everything you kno
6 min read
Reinier de Graaf on “the Creative Tension Between Thinking and Doing”
Architecture Australia
UNLIMITED
Reinier de Graaf on “the Creative Tension Between Thinking and Doing”
Jul 2, 2018
Reinier de Graaf is a partner at the Office for Metropolitan Architecture (OMA) and co-founder of the practice’s research arm, AMO. As an architect, an architectural and urban theorist, a writer and an educator, he regularly works well beyond a narro
7 min read
QUICK GUIDE 6: How To Do Research
You South Africa
UNLIMITED
QUICK GUIDE 6: How To Do Research
Jul 21, 2023
3 min read
Business Buddy
Writing Magazine
UNLIMITED
Business Buddy
Nov 2, 2023
Some years ago, I had an idea for a non-fiction children’s book. It sat undeveloped (and unloved) in my archives until it was resurrected recently as something that I am co-writing with a writer who was a teacher. Having co-authored several books and
1 min read
For Google, Everything Is a Popularity Contest
The Atlantic
UNLIMITED
For Google, Everything Is a Popularity Contest
Jun 27, 2017
8 min read
How To Read A Book
Writing Magazine
UNLIMITED
How To Read A Book
Apr 2, 2020
3 min read
How To Craft An Anthology Proposal
The Writer
UNLIMITED
How To Craft An Anthology Proposal
Jun 18, 2019
5 min read
How to Sound Smart and Memorable, Anytime
Entrepreneur
UNLIMITED
How to Sound Smart and Memorable, Anytime
Sep 19, 2023
4 min read
Note-taking Applications For Family History
Family Tree UK
UNLIMITED
Note-taking Applications For Family History
Mar 10, 2023
7 min read
How (And When) To Think Like A Philosopher
NPR
UNLIMITED
How (And When) To Think Like A Philosopher
Apr 10, 2017
3 min read
What Designers Can Learn From a Pioneering Anthropologist
The Atlantic
UNLIMITED
What Designers Can Learn From a Pioneering Anthropologist
Jul 1, 2011
3 min read
From SCRATCH to ANTHOLOGY
Writer's Digest
UNLIMITED
From SCRATCH to ANTHOLOGY
Jul 31, 2019
8 min read
Schopenhauer’s Advice on How to Achieve Great Things
The Atlantic
UNLIMITED
Schopenhauer’s Advice on How to Achieve Great Things
Feb 8, 2024
6 min read
Why Robot Brains Need Symbols
Nautilus
UNLIMITED
Why Robot Brains Need Symbols
Dec 6, 2018
Nowadays, the words “artificial intelligence” seem to be on practically everyone’s lips, from Elon Musk to Henry Kissinger. At least a dozen countries have mounted major AI initiatives, and companies like Google and Facebook are locked in a massive b
16 min read
On Reading And Lucky Jobs
Architecture NZ
UNLIMITED
On Reading And Lucky Jobs
May 4, 2024
5 min read
WRITE to KNOW
Family Tree
UNLIMITED
WRITE to KNOW
Apr 25, 2023
7 min read
Write That Book For NaNoWriMo
PC Pro Magazine
UNLIMITED
Write That Book For NaNoWriMo
Oct 7, 2021
7 min read
SYNC OR SWIM Trello
Screen Education
UNLIMITED
SYNC OR SWIM Trello
Sep 15, 2019
8 min read
Getting Real About Time
Poets & Writers
UNLIMITED
Getting Real About Time
Aug 14, 2024
9 min read
Writing Motivation
The Writer
UNLIMITED
Writing Motivation
Jun 24, 2023
Where do writing motivation problems come from, and how can we overcome them? In recent years, there has been a movement in psychology to understand what happens in our brains when things are going well. Research in neurology, cognitive psychology, p
5 min read

Related categories

Skip carousel

Reviews for Database Design and Relational Theory

Rating: 4 out of 5 stars

4/5

2 ratings0 reviews

Book preview

Database Design and Relational Theory - C. J. Date

Part ISetting the Scene

Setting the Scene

This part of the book consists of two introductory chapters, the titles of which (Preliminaries and Prerequisites, respectively) are more or less self-explanatory.

C. J. DateDatabase Design and Relational Theoryhttps://doi.org/10.1007/978-1-4842-5540-7_1

1. Preliminaries

C. J. Date¹

(1)

Healdsburg, California, USA

(On being asked what jazz is:)

Man, if you gotta ask, you’ll never know.

—Louis Armstrong (attrib.)

This book has as its subtitle Normal Forms and All That Jazz. Clearly some explanation is needed! First of all, of course, I’m talking about design theory—database design theory, that is—and everybody knows that normal forms are a major component of that theory; hence the first part of the subtitle. But there’s more to that theory than just normal forms, and that fact accounts for that subtitle’s second part. Third, it’s unfortunately the case that—from the practitioner’s point of view, at any rate—design theory seems to be riddled with terms and concepts that are hard to understand and don’t seem to have much to do with design as actually done in practice. That’s why I framed the latter part of my subtitle in colloquial (not to say slangy) terms; I wanted to convey the idea that, although we’d necessarily be dealing with difficult material on occasion, the treatment of that material would be as undaunting and unintimidating as I could make it. But whether I’ve succeeded in that aim is for you to judge, of course.

I’d also like to say a little more on the question of whether design theory has anything to do with design as carried out in practice. Let me be clear: Nobody could, or should, claim that database design is easy. But a sound knowledge of the theory can only help. In fact, if you want to do design properly—if you want to build databases that are as robust, flexible, and accurate as they’re supposed to be—then you simply have to come to grips with the theory. There’s just no alternative: at least, not if you want to claim to be a design professional. Design theory is the scientific foundation for database design, just as the relational model is the scientific foundation for database technology in general. And just as anyone professionally involved in database technology in general needs to be familiar with the relational model, so anyone involved in database design in particular needs to be familiar with design theory. Proper design is so important! After all, the database lies at the heart of so much of what we do in the computing world; so if it’s badly designed, the negative impacts can be extraordinarily widespread.

Some Quotes from the Literature

Since we’re going to be talking quite a lot about normal forms, I thought it might be—well, not exactly enlightening, but entertaining, possibly (?)—to begin with a few quotes from the literature. The starting point for the whole concept of normal forms is, of course, first normal form (1NF), and so an obvious question is: Do you know what 1NF is? As the following quotes demonstrate (sources omitted to protect the guilty), a lot of people don’t:

To achieve first normal form, each field in a table must convey unique information.

An entity is said to be in the first normal form (1NF) when all attributes are single valued.

A relation is in 1NF if and only if all underlying domains contain atomic values only.

If there are no repeating groups of attributes, then [the table] is in 1NF.

Now, it might be argued that some if not all of these quotes are at least vaguely correct—but they’re all hopelessly sloppy, even when they’re generally on the right lines. Note: In case you’re wondering, I’ll be giving a precise and accurate definition of 1NF in Chapter 4.

Let’s take a closer look at what’s going on here. Here again is the first of the foregoing quotes, now given in full:

To achieve first normal form, each field in a table must convey unique information. For example, if you had a Customer table with two columns for the telephone number, your design would violate first normal form. First normal form is fairly easy to achieve, since few folks would see a need for duplicate information in a table.

OK, so apparently we’re talking about a design that looks something like this:

../images/489629_2_En_1_Chapter/489629_2_En_1_Figa_HTML.jpg

Now, I can’t say whether this is a good design or not, but it certainly doesn’t violate 1NF. (I can’t say whether it’s a good design because I don’t know exactly what two columns for the telephone number means—the phrase duplicate information in a table suggests we’re recording the same phone number twice, but such an interpretation is absurd on its face. But even if that interpretation is correct, it still wouldn’t constitute a violation of 1NF as such.)

Here’s another quote:

First Normal Form ... means the table should have no repeating groups of fields ... A repeating group is when you repeat the same basic attribute (field) over and over again. A good example of this is when you wish to store the items you buy at a grocery store ... [and the writer goes on to give an example, presumably meant to illustrate the concept of a repeating group, of a table called Item Table, with columns called Customer, Item1, Item2, Item3, and Item4]:

../images/489629_2_En_1_Chapter/489629_2_En_1_Figb_HTML.jpg

Well, this design is almost certainly bad—what happens if the customer doesn’t purchase exactly four items?—but the reason it’s bad isn’t that it violates 1NF; like the previous example, in fact, it’s a 1NF design. So, while it might perhaps be claimed—indeed, it often is claimed—that 1NF does mean, loosely, no repeating groups, a repeating group is not when you repeat the same basic attribute over and over again.¹

How about this one (a cry for help found on the Internet)? I’m quoting it absolutely verbatim, except that I’ve added some boldface:

I have been trying to find the correct way of normalizing tables in Access. From what I understand, it goes from the 1st normal form to 2nd, then 3rd. Usually, that’s as far as it goes, but sometimes to the 5th and 6th. Then, there’s also the Cobb 3rd. This all makes sense to me. I am supposed to teach a class in this starting next week, and I just got the textbook. It says something entirely different. It says 2nd normal form is only for tables with a multiple-field primary key, 3rd normal form is only for tables with a single-field key. 4th normal form can go from 1st to 4th, where there are no independent one-to-many relationships between primary key and non-key fields. Can someone clear this up for me please?

And one more (this time with a helpful response):

It’s not clear to me what normalizedmeans. Can you be specific about what normalization rules you are referring to? In what way is my schema not normalized?

Normalization: The process of replacing duplicate things with a reference to the original thing.

For example, given john is-a person and john obeys army, one observes that the john in the second sentence is a duplicate of john in the first sentence. Using the means provided by your system, the second sentence should be stored as ->john obeys army.

A Note on Terminology

As I’m sure you noticed, the quotes in the previous section were expressed for the most part in the familiar user friendly terminology of tables, rows, and columns (or fields). In this book, by contrast, I’ll favor the more formal terms relation , tuple (usually pronounced to rhyme with couple), and attribute. I apologize if this decision on my part makes the text a little harder to follow, but I do have my reasons. As I said in SQL and Relational Theory: ²

I’m generally sympathetic to the idea of using more user friendly terms, if they can help make the ideas more palatable. In the case at hand, however, it seems to me that, regrettably, they don’t make the ideas more palatable; instead, they distort them, and in fact do the cause of genuine understanding a grave disservice. The truth is, a relation is not a table, a tuple is not a row, and an attribute is not a column. And while it might be acceptable to pretend otherwise in informal contexts—indeed, I often do so myself—I would argue that it’s acceptable only if all parties involved understand that those more user friendly terms are just an approximation to the truth and fail overall to capture the essence of what’s really going on. To put it another way: If you do understand the true state of affairs, then judicious use of the user friendly terms can be a good idea; but in order to learn and appreciate that true state of affairs in the first place, you really do need to come to grips with the formal terms.

To the foregoing, let me add that (as I said in the preface) I do assume you know exactly what relations, attributes, and tuples are—though in fact formal definitions of these constructs can be found in Chapter 5.

There’s another terminological matter I need to get out of the way, too. The relational model is, of course, a data model. Unfortunately, however, this latter term has two quite distinct meanings in the database world.³ The first and more fundamental one is this:

Definition (data model,first sense): An abstract, self-contained, logical definition of the data structures, data operators, and so forth, that together make up the abstract machine with which users interact.

This is the meaning we have in mind when we talk about the relational model in particular: The data structures in the relational model are relations, of course, and the data operators are the relational operators projection, join, and all the rest. (As for that and so forth in the definition, it covers such matters as keys, foreign keys, and various related concepts.)

The second meaning of the term data model is as follows:

Definition (data model,second sense): A model of the data (especially the persistent data) of some particular enterprise.

In other words, a data model in the second sense is just a (logical, and possibly somewhat abstract) database design. For example, we might speak of the data model for some bank, or some hospital, or some government department.

Having explained these two different meanings, I’d like to draw your attention to an analogy that I think nicely illuminates the relationship between them:

A data model in the first sense is like a programming language, whose constructs can be used to solve many specific problems but in and of themselves have no direct connection with any such specific problem.

A data model in the second sense is like a specific program written in that language—it uses the facilities provided by the model, in the first sense of that term, to solve some specific problem.

It follows from all of the above that if we’re talking about data models in the second sense, then we might reasonably speak of relational models in the plural, or a relational model, with an indefinite article. But if we’re talking about data models in the first sense, then there’s only one relational model, and it’s the relational model, with the definite article.

Now, as you are probably aware, most writings on database design, especially if their focus is on pragma rather than the underlying theory, use the term model, or the term data model, exclusively in the second sense. But—please note very carefully!—I don’t follow this practice in the present book; in fact, I don’t use the term model at all, except occasionally to refer to the relational model as such.

The Running Example

Now let me introduce the example I’ll be using as a basis for most of the discussions in the rest of the book: the familiar—not to say hackneyed—suppliers-and-parts database. (I apologize for dragging out this old warhorse yet one more time, but I do believe that using essentially the same example in a variety of different books and publications can help, not hinder, the learning process.) Sample values are shown in Figure 1-1 on the next page.⁴ To elaborate:

Suppliers: Relvar S denotes suppliers.⁵ Each supplier has one supplier number (SNO), unique to that supplier; one name (SNAME), not necessarily unique (though the SNAME values in Figure 1-1 do happen to be unique); one status value (STATUS), representing some kind of ranking or preference level among suppliers; and one location (CITY).

Parts: Relvar P denotes parts (more accurately, kinds of parts). Each kind of part has one part number (PNO), which is unique; one name (PNAME), not necessarily unique; one color (COLOR); one weight (WEIGHT); and one location where parts of that kind are stored (CITY).

Shipments: Relvar SP denotes shipments—it shows which parts are supplied, or shipped, by which suppliers. Each shipment has one supplier number (SNO), one part number (PNO), and one quantity (QTY). Also, I assume for the sake of the example that there’s at most one shipment at any given time for a given supplier and a given part, and so each shipment has a supplier-number / part-number combination that’s unique.

../images/489629_2_En_1_Chapter/489629_2_En_1_Fig1_HTML.jpg

Figure 1-1

The suppliers-and-parts database—sample values

Keys

Before going any further , I need to review the familiar concept of keys, in the relational sense of that term. First of all, as I’m sure you know, every relvar has at least one candidate key. A candidate key is basically just a unique identifier; in other words, it’s a combination of attributes—often but not always a combination consisting of just a single attribute—such that every tuple in the relvar has a unique value for the combination in question. For example, with respect to the database of Figure 1-1:

Every supplier has a unique supplier number and every part has a unique part number, so {SNO} is a candidate key for S and {PNO} is a candidate key for P.

As for shipments, given the assumption that there’s at most one shipment at any given time for a given supplier and a given part, {SNO,PNO} is a candidate key for SP.

Note the braces, by the way; to repeat, candidate keys are always combinations, or sets, of attributes (even when the set in question contains just one attribute), and the conventional representation of a set on paper is as a commalist of elements enclosed in braces.

This is the first time I’ve mentioned the term commalist, which I’ll be using from time to time in the pages ahead. It can be defined as follows. Let xyz be some syntactic construct (for example, attribute name); then the term xyz commalist denotes a sequence of zero or more xyz’s in which each pair of adjacent xyz’s is separated by a comma (blank spaces appearing immediately before or after any comma are ignored). For example, if A, B, and C are attribute names, then the following are all attribute name commalists:

A , B , C

C , A , B

A , C

So too is the empty sequence of attribute names.

Moreover, when some commalist is enclosed in braces and thereby denotes a set, then (a) blank spaces appearing immediately after the opening brace or immediately before the closing brace are ignored, (b) the order in which the elements appear within the commalist is immaterial (because sets have no ordering to their elements), and (c) if an element appears more than once, it’s treated as if it appeared just once (because sets don’t contain duplicate elements).

Next, as I’m sure you also know, a primary key is a candidate key that’s been singled out in some way for some kind of special treatment. Now, if the relvar in question has just one candidate key, then it doesn’t make any real difference if we call that key primary. But if the relvar has two or more candidate keys, then it’s usual to choose one of them to be primary, meaning it’s somehow more equal than the others. Suppose, for example, that suppliers always have both a unique supplier number and a unique supplier name, so that {SNO} and {SNAME} are both candidate keys. Then we might choose {SNO}, say, to be the primary key.

Observe now that I said it’s usual to choose a primary key. Indeed it is usual—but it’s not 100% necessary. If there’s just one candidate key, then there’s no choice and no problem; but if there are two or more, then having to choose one and make it primary smacks a little bit of arbitrariness, at least to me. (Certainly there are situations where there don’t seem to be any really good reasons for making such a choice. There might even be good reasons for not doing so. Appendix C elaborates on such matters.) For reasons of familiarity, I’ll usually follow the primary key discipline myself in this book—and in pictures like Figure 1-1 I’ll indicate primary key attributes by double underlining—but I want to stress the fact that it’s really candidate keys, not primary keys, that are significant from a relational point of view, and indeed from a design theory point of view as well. Partly for such reasons, from this point forward I’ll use the term key, unqualified, to mean any candidate key, regardless of whether the candidate key in question has additionally been designated as primary. (In case you were wondering, the special treatment enjoyed by primary keys over other candidate keys is mainly syntactic in nature, anyway; it isn’t fundamental, and it isn’t very important.)

More terminology: First, a key involving two or more attributes is said to be composite (and a noncomposite key is sometimes said to be simple) . Second, if a given relvar has two or more keys and one is chosen as primary, then the others are sometimes said to be alternate keys (see Appendix C). Third, a foreign key is a combination, or set, of attributes FK in some relvar R2 such that each FK value is required to be equal to some value of some key K in some relvar R1 (R1and R2 not necessarily distinct).⁶ With reference to Figure 1-1, for example, {SNO} and {PNO} are both foreign keys in relvar SP, corresponding to keys {SNO} and {PNO} in relvars S and P, respectively.

The Place of Design Theory

As I said in the preface, by the term design I mean logical design, not physical design. Logical design is concerned with what the database looks like to the user (which means, loosely, what relvars exist and what constraints apply to those relvars); physical design, by contrast, is concerned with how a given logical design maps to physical storage.⁷ And the term design theory refers specifically to logical design, not physical design—the point being that physical design is necessarily dependent on aspects (performance aspects in particular) of the target DBMS, whereas logical design is, or should be, DBMS independent. Throughout this book, then, the unqualified term design should be understood to mean logical design specifically, unless the context demands otherwise.

Now, design theory as such isn’t part of the relational model; rather, it’s a separate theory that builds on top of that model. (It’s appropriate to think of it as part of relational theory in general, but it’s not, to repeat, part of the relational model per se.) Thus, design concepts such as further normalization are themselves based on more fundamental notions—e.g., the projection and join operators of the relational algebra—that are part of the relational model. (All of that being said, however, it could certainly be argued that design theory is a logical consequence of the relational model, in a sense. In other words, I think it would be inconsistent to agree with the relational model in general but not to agree with the design theory that’s based on it.)

The overall objective of logical design is to achieve a design that’s (a) hardware independent, for obvious reasons; (b) operating system and DBMS independent, again for obvious reasons; and finally, and perhaps a little controversially, (c) application independent (in other words, we’re concerned primarily with what the data is, rather than with how it’s going to be used). Application independence in this sense is desirable for the very good reason that it’s normally—perhaps always—the case that not all uses to which the data will be put are known at design time; thus, we want a design that’ll be robust, in the sense that it won’t be invalidated by the advent of application requirements that weren’t foreseen at the time of the original design. Observe that one important consequence of this state of affairs is that we aren’t (or at least shouldn’t be) interested in making design compromises for physical performance reasons. Design theory in general, and individual database designs in particular, should never be driven by mere performance considerations.

Back to design theory as such. As we’ll see, that theory includes a number of formal theorems, theorems that provide practical guidelines for designers to follow. So if you’re a designer, you need to be familiar with those theorems. Let me quickly add that I don’t mean you need to know how to prove the theorems in question (though in fact the proofs are often quite simple); what I mean is, you need to know what the theorems say—i.e., you need to know the results—and you need to be prepared to apply those results. That’s the nice thing about theorems: Once somebody’s proved them, then their results become available for anybody to use whenever they need to.

Now, it’s sometimes claimed, not entirely unreasonably, that all design theory really does is bolster up your intuition. What do I mean by this remark? Well, consider the suppliers-and-parts database. The obvious design for that database is the one illustrated in Figure 1-1; I mean, it’s obvious that three relvars are necessary, that attribute STATUS belongs in relvar S, that attribute COLOR belongs in relvar P, that attribute QTY belongs in relvar SP, and so on. But why exactly are these things obvious? Well, suppose we try a different design; for example, suppose we move the STATUS attribute out of relvar S and into relvar SP (intuitively the wrong place for it, of course, since status is a property of suppliers, not shipments). Figure 1-2 on the next page shows a sample value for this revised shipments relvar, which I’ll call STP to avoid confusion:⁸

../images/489629_2_En_1_Chapter/489629_2_En_1_Fig2_HTML.jpg

Figure 1-2

Relvar STP—sample value

A glance at the figure is sufficient to show what’s wrong with this design: It’s redundant, in the sense that every tuple for supplier S1 tells us S1 has status 20, every tuple for supplier S2 tells us S2 has status 30, and so on.⁹ And design theory tells us that not designing the database in the obvious way will lead to such redundancy, and tells us also (albeit implicitly, perhaps) what the consequences of such redundancy will be. In other words, design theory is largely—though not exclusively—about reducing redundancy, as we’ll see. (As an aside, I remark that partly for such reasons, the theory has been described, perhaps a little unkindly, as a good source of bad examples.)

Now, if design theory really does just bolster up your intuition, then it might be (and indeed has been) criticized on the grounds that it’s really all just common sense anyway. By way of example, consider relvar STP again. As I’ve said, that relvar is obviously badly designed; the redundancies are obvious, the consequences are obvious too, and any competent human designer would naturally avoid such a design, even if that designer had no explicit knowledge of design theory at all. But what does naturally mean here? What principles are being applied by that human designer in opting for a more natural (and better) design?

The answer is: They’re exactly the principles that design theory talks about (the principles of normalization, for example). In other words, competent designers already have those principles in their brain, as it were, even if they’ve never studied them formally and can’t put a name to them or articulate them precisely. So yes, the principles are common sense—but they’re formalized common sense. (Common sense might be common, but it’s not always easy to say exactly what it is!) What design theory does is state in a precise way what certain aspects of common sense consist of. In my opinion, that’s the real achievement—or one of the real achievements, anyway—of the theory: It formalizes certain commonsense principles, thereby opening the door to the possibility of mechanizing those principles (that is, incorporating them into computerized design tools). Critics of the theory often miss this point; they claim, quite rightly, that the ideas are mostly just common sense, but they don’t seem to realize it’s a significant achievement to state what common sense means in a precise and formal way.

As a kind of postscript to the foregoing, I note that common sense might not always be that common anyway. The following lightly edited extract from a paper by Robert R. Brown¹⁰ illustrates the point. Brown begins by giving a simplified real example—his words—involving an employee file (with fields for employee number, employee name, phone number, department number, and manager name) and a department file (with fields for department number, department name, manager name, and manager’s phone number), where everything has the intuitively obvious meaning. Then he continues:

The actual database on which this example is based had many more files and fields and much more redundancy. When the designer was asked his reasons for such a design, he cited performance and the difficulty of doing joins. Even though the redundancy should be clear to you in my example, it was not that evident in the design documentation. In large databases with many more files and fields, it is impossible to find the duplications without doing extensive information analysis and without having extended discussions with the experts in the user organizations.

Incidentally, there’s another quote I like a lot—in fact, I used it as an epigraph in SQL and Relational Theory—that supports my contention that practitioners really do need to know the theoretical foundations of their field. It’s from Leonardo da Vinci (and is thus some 500 years old!), and it goes like this (I’ve added the boldface):

Those who are enamored of practice without theory are like a pilot who goes into a ship without rudder or compass and never has any certainty where he is going. Practice should always be based upon a sound knowledge of theory.

Aims of this Book

If you’re like me, you’ll have encountered lots of design theory terms in the literature and live presentations and the like—terms such as projection-join normal form, the chase, join dependency , FD preservation, and many others—and I’m sure you’ve wondered from time to time exactly what they all mean. Thus, it’s one of my aims in this book to explain such terms: to define them carefully and accurately, to explain their relevance and applicability, and generally to remove any air of mystery that might seem to surround them. And if I’m successful in that aim, I’ll have gone a good way to explaining what design theory is and why it’s important (indeed, a possible alternative title for the book could well be Database Design Theory: What It Is and Why You Should Care). Overall, it’s my goal to provide a painless introduction to design theory for database professionals. More specifically, what I want to do is the following:

Review, albeit from a possibly unfamiliar perspective, aspects of design you should already be familiar with

Explore in depth aspects you’re probably not already familiar with

Provide clear and accurate explanations and definitions (with plenty of examples) of all pertinent concepts

Not spend too much time on material that’s widely understood already, such as second and third normal form (2NF and 3NF)¹¹

All of that being said, I should say too that database design is not my favorite subject. The reason it’s not is that much of that subject is still somewhat ... well, subjective. As I said earlier, design theory is the scientific foundation for database design. Sadly, however, there are numerous design issues that the theory simply doesn’t address at all (at least, not yet). Thus, while the formal principles I’ll be describing in this book do represent the scientific part of design, there are other parts that, as I’ve put it elsewhere, are still much more in the nature of an artistic endeavor. Indeed, one message of the book is precisely that we need more science in this field (see Chapter 17).

To put a more positive spin on matters, I’d like to draw your attention to the following. Design theory is, at least in part, about capturing the meaning of data, and as Codd himself once said in connection with that notion:¹²

[The] task of capturing the meaning of data (in a reasonably formal way) is never ending ... The goal is nevertheless an extremely important one because even small successes can bring understanding and order into the field of database design.

In fact, I’ll go further: If your design violates any of the known science, then, as I’ve written elsewhere (in a slightly different context), the one thing you can be sure of is that things will go wrong. And though it might be hard to say exactly what will go wrong, and it might be hard to say whether things will go wrong in a major or minor way, you know—it’s guaranteed—that they will go wrong. Theory is important.

Concluding Remarks

This book grew in the writing; it turns out that, despite the slightly negative tone of some of the remarks in the previous section, there’s really quite a lot of good material to cover. What’s more, the material builds. Thus, while the first few chapters might seem to be going rather slowly, I think you’ll find the pace picks up later on. Part of the point is the number of terms and concepts that need to be introduced; the ideas aren’t really difficult, but they can seem a little overwhelming, at least until you’re comfortable with the terminology. For that reason, at least in certain key parts of the book, I’ll be presenting the material twice—first from an informal perspective, and then again from a more formal one. (As Bertrand Russell once memorably said: Writing can be either readable or precise, but not at the same time. I’m trying to have my cake and eat it too.)

And talking of Bertrand Russell, it seems appropriate to close this chapter with another wonderful quote from his writings:¹³

I have been accused of a habit of changing my opinions ... I am not myself in any degree ashamed of [that habit]. What physicist who was already active in 1900 would dream of boasting that his opinions had not changed during the last half century? ... The kind of philosophy that I value and have endeavoured to pursue is scientific, in the sense that there is some definite knowledge to be obtained and that new discoveries can make the admission of former error inevitable to any candid mind. For what I have said, whether early or late, I do not claim the kind of truth which theologians claim for their creeds. I claim only, at best, that the opinion expressed was a sensible one to hold at the time ... I should be much surprised if subsequent research did not show that it needed to be modified. [Such opinions were not] intended as pontifical pronouncements, but only as the best I could do at the time towards the promotion of clear and accurate thinking. Clarity, above all, has been my aim.

I’ve quoted this extract elsewhere—in the preface to my book An Introduction to Database Systems (8th edition, Addison-Wesley, 2004) in particular. The reason I mention this latter book is that it includes among other things a tutorial treatment of some of the material covered in more depth in the present book. But the world has moved on; my own understanding of the theory is, I hope, quite a lot better than it was when I wrote that earlier book, and there are aspects of the treatment in that book that I would frankly now like to revise. One problem with that earlier treatment was that I attempted to make the material more palatable by adopting the fiction that every relvar has just one key, which could then harmlessly be regarded as the primary key. But a consequence of that simplifying assumption was that several of the definitions I gave (e.g., of 2NF and 3NF) were less than fully accurate. This state of affairs has led to a certain amount of confusion in the community—partly my fault, I freely admit, but partly also the fault of people who took the definitions out of context.

Exercises

The purpose of these exercises is to give some idea of the scope of the chapters to come, and also perhaps to test the extent of your existing knowledge. They can’t be answered from material in the present chapter alone.

1.1

Is it true that the relational model doesn’t require relvars to be in any particular normal form?

1.2

Should data redundancy always be eliminated? Can it be?

1.3

What’s the difference between 3NF and BCNF?

1.4

Is it true that every all key relvar is in BCNF?

1.5

Is it true that every binary relvar is in 4NF?

1.6

Is it true that every all key relvar is in 5NF?

1.7

Is it true that every binary relvar is in 5NF?

1.8

Is it true that if a relvar is in BCNF but not 5NF, then it must be all key?

1.9

Is it true that if a relvar has just one key and just one other attribute, then it’s in 5NF?

1.10

Can you give a precise definition of 5NF?

1.11

Is it true that if a relvar is in 5NF, then it’s redundancy free?

1.12

What precisely is denormalization?

1.13

What’s Heath’s Theorem, and why is it important?

1.14

What’s The Principle of Orthogonal Design?

1.15

What makes some JDs irreducible and others not?

1.16

What’s dependency preservation, and why is it important?

1.17

What’s the chase?

1.18

How many normal forms can you name?

Answers

Note: All mistakes in this and other Answers sections in this book are deliberate .

1.1

Yes, it is. Good design benefits the user, and to some extent the DBMS as well, but the relational model as such doesn’t care how the database happens to be designed, just so long as the objects it has to deal with are indeed relations and not something else (which, sadly, they often are, in SQL¹⁴).

1.2

See Chapter 17.

1.3

See Chapters 4 and 5.

1.4

Yes (see Chapters 4 and 5).

1.5

No. (Actually, it’s not even true that every binary relvar is in 2NF. See Exercise 4.6.)

1.6

No (see Chapters 9 and 10).

1.7

No a fortiori, given the answer to Exercise 1.5.

1.8

No (see Chapter 13).

1.9

No (see Chapter 13).

1.10

See Chapter 10.

1.11

No (see Chapters 9 and 17).

1.12

See Chapter 8.

1.13

See Chapter 5.

1.14

See Chapter 16.

1.15

See Chapter 11.

1.16

See Chapter 7.

1.17

See Chapter 11.

1.18

See Chapter 15.

Footnotes

At the same time it’s not as easy as you might think to say exactly what it is! See further discussion in Chapter 4.

I remind you from the preface that throughout this book I use SQL and Relational Theory as an abbreviated form of reference to my book SQL and Relational Theory: How to Write Accurate SQL Code (3rd edition, O’Reilly, 2015).

This observation is undeniably correct. However, one reviewer wanted me to add that the two meanings can be thought of as essentially the same concept at different levels of abstraction. I hope that helps!

For reasons that might or might not become clear later, the values shown in Fig. 1.1 differ in two small respects from those in other books of mine: First, the status for supplier S2 is shown as 30 instead of 10; second, the city for part P3 is shown as Paris instead of Oslo.

If you don’t know what a relvar is, for now you can just take it to be a table in the usual database sense. See Chapter 2 for further explanation.

This definition is deliberately a little simplified (though it’s good enough for present purposes). A better one can be found in Chapter 3, also in SQL and Relational Theory.

Be aware, however, that other writers (a) use those terms logical design and physical design to mean something else and (b) use other terms to mean what I mean by those terms. Caveat lector.

For obvious reasons, throughout this book I use T, not S, as an abbreviation for STATUS.

You might notice another problem, too: The design can’t properly represent suppliers like supplier S5 who currently supply no parts at all. This problem and others like it are discussed in Chapter 3.

Robert R. Brown: Database Systems in Engineering: Key Problems, and Potential Solutions, in the proceedings of a database symposium held in Sydney, Australia (November 15th-17th, 1984).

However, I will at least give precise definitions of those familiar concepts for reasons of completeness. Since I’m sure they really are familiar, however, I’ll take the liberty of appealing to them from time to time even before we get to those definitions.

The quote—which I’ve edited somewhat here (the italics are mine)—is taken from Codd’s paper Extending the Database Relational Model to Capture More Meaning, ACM Transactions on Database Systems 4, No. 4 (1979). E. F. (Ted) Codd was, of course, the inventor of the relational model. What’s more, he was also the person who first defined the concept of normalization in general, as well as the first three normal forms (1NF, 2NF, 3NF) in particular.

The quote is from the preface to The Bertrand Russell Dictionary of Mind, Matter and Morals (ed., Lester E. Denonn; Citadel Press, 1993). I’ve edited it just slightly here.

Actually, the objects the DBMS has to deal with are never relations in SQL!—except in the very special case in which the object in question is an SQL table with (a) just one column (and that column is properly named), (b) no duplicate rows, and (c) no nulls. Moreover, to comply with the prescriptions of the relational model, they should also (d) contain no pointers (see the answer to Exercise 2.2h in Chapter 2).

C. J. DateDatabase Design and Relational Theoryhttps://doi.org/10.1007/978-1-4842-5540-7_2

2. Prerequisites

C. J. Date¹

(1)

Healdsburg, California, USA

The world is everything that is the case.

—Ludwig Wittgenstein: Tractatus Logico-Philosophicus (1921)

You’re supposed to be a database professional, by which I mean someone who (a) is a database practitioner and (b) has a reasonable degree of familiarity with relational theory. Please note that—I’m sorry to have to say this, but it’s true—a knowledge of SQL, no matter how deep, is not sufficient to satisfy part (b) of this requirement. As I said in SQL and Relational Theory:

I’m sure you know something about SQL; but—and I apologize for the possibly offensive tone here—if your knowledge of the relational model derives only from your knowledge of SQL, then I’m afraid you won’t know the relational model as well as you should, and you’ll probably know some things that ain’t so. I can’t say it too strongly: SQL and the relational model aren’t the same thing.

The purpose of this chapter, then, is to tell you some things that I hope you already know. If you do, then the chapter will serve as a refresher; if you don’t, then I hope it’ll serve as an adequate tutorial. More specifically, what I want to do is spell out in some detail certain fundamental aspects of relational theory that I’ll be relying on heavily in the pages ahead. The aspects in question are ones that, in my experience, database practitioners often aren’t aware of (at least, not explicitly). Of course, there are other aspects of relational theory I’ll be relying on as well, but I’ll elaborate on those, if I think it necessary, when I come to make use of them.

Overview

Let me begin by giving a quick summary, mainly just for purposes of subsequent reference, of those fundamental aspects of relational theory just mentioned:

Any given database consists of a set of relation variables (relvars for short).

The value of any given relvar at any given time is a relation value (relation for short).

Every relvar represents a certain predicate (the relvar predicate).

Within any given relvar, every tuple represents a certain proposition.

Relvar R at time T contains all and only those tuples that represent instantiations of the predicate corresponding to relvar R that evaluate to TRUE at time T.

The next two sections (which are heavily based on material from SQL and Relational Theory) elaborate on these ideas.

Relations and Relvars

Take another look at Figure 1-1, the suppliers-and-parts database, in Chapter 1. That figure shows three relations: namely, the relations that happen to exist in the database at some particular time. But if we were to look at the same database at some different time, we would probably see three different relations appearing in their place. In other words, S, P, and SP are really variables—relation variables, to be precise—and just like variables in general, they have different

Enjoying the preview?

Page 1 of 1

Database Design and Relational Theory: Normal Forms and All That Jazz

About this ebook

C. J. Date

Related authors

Related to Database Design and Relational Theory

Related ebooks

Beginning Oracle Database 12c Administration: From Novice to Professional

Oracle DBA Mentor: Succeeding as an Oracle Database Administrator

Data Teams: A Unified Management Model for Successful Data-Focused Teams

Exploring C++20: The Programmer's Introduction to C++

SQL Server 2019 Revealed: Including Big Data Clusters and Machine Learning

Complete Guide to Test Automation: Techniques, Practices, and Patterns for Building and Maintaining Effective Software Projects

Scalable Big Data Architecture: A practitioners guide to choosing relevant Big Data architecture

The SQL Server DBA’s Guide to Docker Containers: Agile Deployment without Infrastructure Lock-in

Assessing and Improving Prediction and Classification: Theory and Algorithms in C++

Pro Machine Learning Algorithms: A Hands-On Approach to Implementing Algorithms in Python and R

Numerical Python: Scientific Computing and Data Science Applications with Numpy, SciPy and Matplotlib

Developing Applications with Azure Active Directory: Principles of Authentication and Authorization for Architects and Developers

Applied Natural Language Processing with Python: Implementing Machine Learning and Deep Learning Algorithms for Natural Language Processing

MongoDB Recipes: With Data Modeling and Query Building Strategies

Data Modeling Fundamentals: A Practical Guide for IT Professionals

Understanding Azure Data Factory: Operationalizing Big Data and Advanced Analytics Solutions

Web App Development and Real-Time Web Analytics with Python: Develop and Integrate Machine Learning Algorithms into Web Apps

Architecting CSS: The Programmer’s Guide to Effective Style Sheets

Practical API Architecture and Development with Azure and AWS: Design and Implementation of APIs for the Cloud

Data Engineering with Apache Spark, Delta Lake, and Lakehouse: Create scalable pipelines that ingest, curate, and aggregate complex data in a timely and secure way

Generating a New Reality: From Autoencoders and Adversarial Networks to Deepfakes

Deep Belief Nets in C++ and CUDA C: Volume 1: Restricted Boltzmann Machines and Supervised Feedforward Networks

Building REST APIs with Flask: Create Python Web Services with MySQL

Learning Azure DocumentDB

Beginning SQL Server Reporting Services

Pro SQL Server Internals

Machine Learning with PySpark: With Natural Language Processing and Recommender Systems

Foundations of Python Network Programming

Python for Marketing Research and Analytics

Dynamic SQL: Applications, Performance, and Security in Microsoft SQL Server

Databases For You

SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL

Behind Every Good Decision: How Anyone Can Use Business Analytics to Turn Data into Profitable Insight

Access 2010 All-in-One For Dummies

Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary

Access 2019 For Dummies

Learn Git in a Month of Lunches

Grokking Algorithms: An illustrated guide for programmers and other curious people

Practical Data Analysis

Learn SQL in 24 Hours

SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL

Access 2016 For Dummies

COMPUTER SCIENCE FOR ROOKIES

Blockchain Basics: A Non-Technical Introduction in 25 Steps

Go in Action

LINUX: Beginner's Crash Course. Your Step-By-Step Guide To Learning The Linux Operating System And Command Line Easy & Fast!

Deep Learning with Python: A Comprehensive guide to Building and Training Deep Neural Networks using Python and popular Deep Learning Frameworks

PostgreSQL Administration Essentials

Learn SQL Server Administration in a Month of Lunches

Managing Data Using Excel

IMS-DB Basic Training For Application Developers

Visualizing Graph Data

Schaum’s Outline of Fundamentals of SQL Programming

The Data Model Resource Book: Volume 3: Universal Patterns for Data Modeling

Data Analysis with R

Base SAS Interview Questions You'll Most Likely Be Asked

Starting Database Administration: Oracle DBA

Practical Data Science with R, Second Edition

Access for Beginners: Access Essentials, #1

Python and SQLite Development

Serverless Architectures on AWS, Second Edition

Related podcast episodes

Related articles

Related categories

Reviews for Database Design and Relational Theory

What did you think?

Book preview

Database Design and Relational Theory - C. J. Date

1. Preliminaries

Some Quotes from the Literature

A Note on Terminology

The Running Example

Keys

The Place of Design Theory