Surrogate Keys vs Natural Keys for Primary Key?

February 2007
M	T	W	T	F	S	S
	1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28

Posted by decipherinfosys on February 1, 2007

This topic probably is one of those that you cannot get any two database developers/DBAs to agree upon. Everyone has their own opinion about this and it is also one of the most discussed topics over the web when it comes to data modeling. Rather than taking any side :-), we are just listing out our experiences when it comes to chosing between a surrogate key vs the natural keys for the tables.

Surrogate Key:

Surrogate keys are keys that have no “business” meaning and are solely used to identify a record in the table. Such keys are either database generated (example: Identity in SQL Server, Sequence in Oracle, Sequence/Identity in DB2 UDB etc.) or system generated values (like generated via a table in the schema).

Natural Key:

Keys are natural if the attribute it represents is used for identification independently of the database schema. What this basically means is that the keys are natural if people use them example: Invoice-Numbers, Tax-Ids, SSN etc.

Design considerations for choosing the Primary Key:

Primary Key should meet the following requirements:

It should be not null, Unique and should apply to all rows.
It should be minimal (i.e. less number of columns in the PK: ideally it should be 1, if using composite keys, then make sure that those are surrogates and using integer family data-types).
It should be stable over a period of time (should not change i.e. update to the PK columns should not happen).

Keeping these in mind, here are the pros and cons of Surrogate vs. Natural keys:

Surrogate Key

I prefer surrogate keys to be DB controlled rather than being controlled via a next-up table in the schema since that is a more scalable approach.

Pros:

Business Logic is not in the keys.
Small 4-byte key (the surrogate key will most likely be an integer and SQL Server for example requires only 4 bytes to store it, if a bigint, then 8 bytes).
Joins are very fast.
No locking contentions because of unique constraint (this refers to the waits that get developed when two sessions are trying to insert the same unique business key) as the surrogates get generated by the DB and are cached – very scalable.

Cons:

An additional index is needed. In SQL Server, the PK constraint will always creates a unique index, in Oracle, if an index already exists, PK creation will use that index for uniqueness enforcement (not a con in Oracle).
Cannot be used as a search key.
If it is database controlled, for products that support multiple databases, different implementations are needed, example: identity in SS2k, before triggers and sequences in Oracle, identity/sequence in DB2 UDB.
Always requires a join when browsing the child table(s).

Natural Key

Pros:

No additional Index.
Can be used as a search key.

Cons:

If not chosen wisely (business meaning in the key(s)), then over a period of time additions may be required to the PK and modifications to the PK can occur.
If using strings, joins are a bit slower as compared to the int data-type joins, storage is more as well. Since storage is more, less data-values get stored per index page. Also, reading strings is a two step process in some RDBMS: one to get the actual length of the string and second to actually perform the read operation to get the value.
Locking contentions can arise if using application driven generation mechanism for the key.
Can’t enter a record until value is known since the value has some meaning.

Choosing Surrogate vs. Natural Keys:

There is no rule of thumb in this case. It has to be evaluated table by table:

If we can identify an appropriate natural key that meets the three criteria for it to be a PK column, we should use it. Look-up tables and configuration tables are typically ok.
Data-Type for the PK: the smaller the better, choose an integer or a short-character data type. It also ensures that the joins will be faster. This becomes even more important if you are going to make the PK as a clustered index since non-clustered indexes are built off the clustered index. RDBMS processes integer data values faster than the character data values because it converts characters to ASCII equivalent values before processing, which is an extra step.

This entry was posted on February 1, 2007 at 1:36 am and is filed under Data Model. You can follow any responses to this entry through the RSS 2.0 feed. Responses are currently closed, but you can trackback from your own site.

20 Responses to “Surrogate Keys vs Natural Keys for Primary Key?”

Back to the Basics: IDENTITY INSERT « Systems Engineering and RDBMS said

June 4, 2007 at 12:31 pm
[…] As you know, the Identity property is typically used by database designers when they want to use a surrogate key for a table. This allows them to rely on the database engine to automatically increment the value […]
Blocking on un-committed unique keys « Systems Engineering and RDBMS said

August 22, 2007 at 11:25 am
[…] this can once again lead to blocking situations related to un-committed keys. The creation of the surrogate keys should be left to the DB – for identity property columns in SQL Server and DB2 LUW as well as the […]
Using GUID as the Primary Key « Systems Engineering and RDBMS said

April 1, 2008 at 4:03 pm
[…] decipherinfosys on April 1, 2008 We have blogged before about how to go about choosing between a natural key vs a surrogate key when doing data modeling work. One of the recent questions that was raised was about using a GUID […]
Foreign Key Issue and resolutions « Systems Engineering and RDBMS said

August 8, 2008 at 7:36 pm
[…] situation was like this: There were two tables: COMPANY and COMPANY_SECURITY. Both had a surrogate key defined as an auto-incremental ID value. COMPANY table had an alternate key defined on CUSIP – in […]
Tricks And Tips For SQL Optimization « NeerajTripathi's Blog – Few technical approaches.. said

December 14, 2009 at 5:18 am
[…] For Surrogate key Vs. Natural Key Refer […]
Neeraj Tripathi's Blog » Tricks And Tips For SQL Optimization – SQL Server said

June 20, 2010 at 9:41 am
[…] For Surrogate key Vs. Natural Key Refer […]
¿Claves primarias naturales o subrogadas? | reThink.net said

January 31, 2011 at 8:31 am
[…] Recordemos que una Clave primaria (PK) es un conjunto de campos que identifica de forma única un registro de una tabla. Puede ser un solo campo o varios. El debate se produce cuando tenemos que escoger qué campos formarán la clave primaria de la tabla. Podemos utilizar una clave natural o una subrogada. […]
Argument for Natural Primary Key over a Surrogate Primary key said

March 9, 2011 at 5:57 pm
[…] https://decipherinfosys.wordpress.com/2007/02/01/surrogate-keys-vs-natural-keys-for-primary-key/ Be the first to rate this postCurrently 0/5 Stars.12345 Tags: Categories: Actions: E-mail | Kick it! | Permalink | Comments (0) | Comment RSS […]
Surrogate Keys | James Serra's Blog said

January 9, 2012 at 4:01 pm
[…] Surrogate Keys vs Natural Keys for Primary Key? Share this:EmailPrintFacebookShareDiggRedditStumbleUpon […]
Software business when software paradigms constantly change « Optimeon said

April 5, 2012 at 5:23 pm
[…] of natural keys. There are quite a few objective comparison of the 2 approaches, for example by Decipher Information Systems or in Wikipedia. However, for software business, the author still feels that natural key is the way […]
代理キーとナチュラルキー | TechRacho said

January 25, 2014 at 2:39 am
[…] Surrogate Keys vs Natural Keys for Primary Key? […]
Is there any benefit of a primary key that comprises all columns of the table? | CL-UAT said

December 28, 2014 at 1:36 am
[…] Surrogate Keys vs Natural Keys for Primary Key […]
Microsoft 70-463 Exam Dumps From Braindump2go Covers The Latest Knowledge Points From Microsoft Exam Centre (91-100) | Offer Braindump2go Latest Microsoft Exam Questions said

February 9, 2015 at 7:55 am
[…] http://msdn.microsoft.com/en-us/library/ms174884.aspx https://decipherinfosys.wordpress.com/2007/02/01/surrogate-keys-vs-natural-keys-for-primary-key/ http://www.agiledata.org/essays/keys.html […]
Microsoft 70-463 Certification New Released Sample Questions Free Download from Braindump2go (91-100) | Free Download Braindump2go Microsoft Exam Questions and Dumps PDF & VCE said

February 9, 2015 at 7:59 am
[…] http://msdn.microsoft.com/en-us/library/ms174884.aspx https://decipherinfosys.wordpress.com/2007/02/01/surrogate-keys-vs-natural-keys-for-primary-key/ http://www.agiledata.org/essays/keys.html […]
Braindump2go New Published Microsoft 70-463 Exam Dumps Questions Free Download! (91-100) | Braindump2go Free Latest MCITP Exam Dumps said

February 9, 2015 at 8:04 am
[…] http://msdn.microsoft.com/en-us/library/ms174884.aspx https://decipherinfosys.wordpress.com/2007/02/01/surrogate-keys-vs-natural-keys-for-primary-key/ http://www.agiledata.org/essays/keys.html […]
New Released Exam Dumps: Microsoft 70-463 Exam Dumps Offer by Braindump2go for Free Download! (91-100) | Braindump2go Updated Real Microsoft MCITP Exam Questions said

February 9, 2015 at 8:04 am
[…] http://msdn.microsoft.com/en-us/library/ms174884.aspx https://decipherinfosys.wordpress.com/2007/02/01/surrogate-keys-vs-natural-keys-for-primary-key/ http://www.agiledata.org/essays/keys.html […]
Is there any benefit of a primary key that comprises all columns of the table? | XL-UAT said

March 1, 2015 at 2:49 am
[…] Surrogate Keys vs Natural Keys for Primary Key […]
Surrogate Key Vs Natural Key | Surrogacy Chat said

April 25, 2015 at 9:20 pm
[…] Surrogate Keys vs Natural Keys for Primary Key? « Systems … – This topic probably is one of those that you cannot get any two database developers/DBAs to agree upon. Everyone has their own opinion about this and it is also one of the most discussed topics over the web when it comes to data modeling. […]
Sql Server Surrogate Key Vs Primary | Surrogacy Chat said

April 27, 2015 at 1:03 am
[…] Surrogate Keys vs Natural Keys for Primary Key? « Systems … – … Identity in SQL Server, Sequence in Oracle, Sequence/Identity in DB2 UDB etc.) … here are the pros and cons of Surrogate vs. Natural keys: Surrogate Key . … Surrogate Keys vs Natural Keys for Primary Key […] […]
Will SQL Server 2005 penalize me for using an nvarchar(50) as a primary key, instead of an integer? - MicroEducate said

March 7, 2022 at 5:42 pm
[…] Systems Engineering and RDBMS […]

Sorry, the comment form is closed at this time.

« Locking trace flags in SQL Server

OTLT (One True Lookup Table) »

Systems Engineering and RDBMS

Categories

Questions?

Archives

Blog Stats

Email Subscriptions

RSS Subscriptions

Recent Posts

Custom Search

Calendar

Top Posts

Blog Roll