Normalization
If you've been working with databases for a while, chances are you've heard the term normalization. Perhaps someone's asked you "Is that database normalized?" or "Is that in BCNF?" All too often, the reply is "Uh, yeah." Normalization is often brushed aside as a luxury that only academics have time for. However, knowing the principles of normalization and applying them to your daily database design tasks really isn't all that complicated and it could drastically improve the performance of your DBMS.
In this article, we'll introduce the concept of normalization and take a brief look at the most common normal forms. Future articles will provide in-depth explorations of the normalization process.
Normalization is basically used to remove three types of anomalies in the database.These anomalies are
1) Insert
2) Update
3) Delete
we will discuss about all these anomalies in the later post.
Before we begin our discussion of the normal forms, it's important to point out that they are guidelines and guidelines only. Occasionally, it becomes necessary to stray from them to meet practical business requirements. However, when variations take place, it's extremely important to evaluate any possible ramifications they could have on your system and account for possible inconsistencies. That said, let's explore the normal forms.
In first normal form we remove the multivalued dependencies.
In the above mentioned table , employee coloumn contains multivalues , so for first normal form we remove this dependency by doing following steps
After creating the two above tables we had remove multivalued dependies.
It is in the First normal form, and
No partial dependency exists between non-key attributes and key attributes.
An attribute of a relation R that belongs to the candidate key of R is said to be a key attribute and that which doesn’t is a non-key attribute
In this article, we'll introduce the concept of normalization and take a brief look at the most common normal forms. Future articles will provide in-depth explorations of the normalization process.
What is Normalization?
Normalization is the process of efficiently organizing data in a database. There are two goals of the normalization process: eliminating redundant data (for example, storing the same data in more than one table) and ensuring data dependencies make sense (only storing related data in a table). Both of these are worthy goals as they reduce the amount of space a database consumes and ensure that data is logically stored.Normalization is basically used to remove three types of anomalies in the database.These anomalies are
1) Insert
2) Update
3) Delete
we will discuss about all these anomalies in the later post.
The Normal Forms
The database community has developed a series of guidelines for ensuring that databases are normalized. These are referred to as normal forms and are numbered from one (the lowest form of normalization, referred to as first normal form or 1NF) through five (fifth normal form or 5NF). In practical applications, you'll often see 1NF, 2NF, and 3NF along with the occasional 4NF. Fifth normal form is very rarely seen and won't be discussed in this article.Before we begin our discussion of the normal forms, it's important to point out that they are guidelines and guidelines only. Occasionally, it becomes necessary to stray from them to meet practical business requirements. However, when variations take place, it's extremely important to evaluate any possible ramifications they could have on your system and account for possible inconsistencies. That said, let's explore the normal forms.
First Normal Form (1NF)
First normal form (1NF) sets the very basic rules for an organized database:In first normal form we remove the multivalued dependencies.
manager | Employee's> |
Rajeev | Ravi,Deepak,Sunil |
In the above mentioned table , employee coloumn contains multivalues , so for first normal form we remove this dependency by doing following steps
manager | Manager ID> |
Rajeev | 504IT |
manager ID | Employee> |
504IT | Ravi |
504IT | Sunil |
504IT | Deepak |
After creating the two above tables we had remove multivalued dependies.
Second Normal Form (2NF)
A Relation is said to be in Second Normal Form if and only if :It is in the First normal form, and
No partial dependency exists between non-key attributes and key attributes.
An attribute of a relation R that belongs to the candidate key of R is said to be a key attribute and that which doesn’t is a non-key attribute
Third Normal Form (3NF)
Third normal form (3NF) goes one large step further:- Meet all the requirements of the second normal form.
- Remove columns that are not dependent upon the primary key.
Fourth Normal Form (4NF)
Finally, fourth normal form (4NF) has one additional requirement:- Meet all the requirements of the third normal form.
- A relation is in 4NF if it has no multi-valued dependencies.
Comments