BigBear.ai
  • Home
  • Industries
    • Academia
    • Government
    • Healthcare
    • Manufacturing
  • Solutions
    • Cyber
    • Data Analytics
    • Enterprise Planning and Logistics
    • Intelligent Automation
    • Modeling Solutions
    • Professional Services
  • Company
    • About
    • Investor Relations
    • Partners
    • Team
  • Careers
    • Benefits
    • Culture
    • Explore Jobs
    • Military and Veterans
    • Applicant Login
    • Employee Login
  • Resources
    • Blog
    • Events
    • Newsroom
    • Resource Library
    • Online Store
  • Contact
Search

Home Data Warehousing Designing a Relational Data Warehouse

Blog

Designing a Relational Data Warehouse

Jim McHugh
November 17, 2016
  • Share
  • Share

Most applications today are built using relational databases. This relational modeling is based on mathematical set theory. It was first described in 1969 by E.F. Codd who stated that a grouping of elements (tuples) could be related to other tuples in a way that would limit the amount of data stored for any specific tuple and therefore optimize the storage of such data. It’s actually easy to do. Let’s look at an example using information collected about John Smith.

John Quincy Smith
DOB: 12/31/1969
National ID: 123-45-6789
1234 Main Street, Anytown, VA 20001 (h)
8000 Corporate Drive, Suite 1022, Sombertown, VA 20101 USA (w)
[email protected] (w)
[email protected] (p)
[email protected] (p)
(703) 555-1212 (w)
(703) 212-1555 (h)
(571) 234-9876 (c)
1234-5678-9012-3456 (Visa)

Could all of this information be stored in a single table? Yes, absolutely but inefficiently. You can see below what happens when we add a second personal email address column. For every record, regardless of whether there is a work address or second personal email address, additional storage, although minimal, will be allocated.

Another challenge is the need to add additional columns. What if you want to add the ability to choose from more than the stored credit card to pay for Smith’s purchase or you want to add and save multiple shipping addresses? In each instance, you will need to add columns to this table and modify a lot of code to ensure you are choosing the correct credit card and shipping address.

blog004-transaction

The more efficient way is to create a table for each of these logical groupings and continue to break these groupings down to their most basic elements. This will provide you with the greatest amount of flexibility using the least amount of storage space.

First, begin by creating logical groupings of data.

Personal Address Email Phone Number Credit Card
Name Address Line 1 Email Address Phone Number Credit Card Number
DOB Address Line 2 Email Type Phone Type Credit Card Type
SSN City
State/Province
Postal Code
Country
This is a good start but there are still some repeating values that we will look up, and you could possibly add, delete or change over time.

Address Type Email Type Phone Number Type
Address Type (Home & Work) Email Type (Work & Personal) Phone Type (Work, Home & Cell)
State Province Country Credit Card Type
State Province Name Country Name Credit Card Type (Visa & MasterCard, etc.)
blog004-2nf

Now connect the tables by joining the appropriate IDs.

blog004-3nf

This is a basic relational data model. Each transactional system in your organization most likely will have data models which may look similar to this one.

This transaction-based relational data model essentially does not differ from a relational data warehouse data model. The same principals are used in both instances although the amount of data stored is usually much larger in the data warehouse. One significant difference may be the addition of fields to the tables in the warehouse to identify the source of the data. Using relational modeling techniques to create a data warehouse for some or all of your source application data will create, a subject-oriented, non-volatile, time-variant and integrated data store for your corporate data. This relational data structure is the foundation upon which you can build reporting, and analytic data structures called Data Marts, which we will discuss in a future blog post.

Posted in Data Modeling, Data Warehousing.
BigBear.ai
  • Home
  • Industries
  • Solutions
  • Company
  • Careers
  • Blog
  • Investor Relations
  • Contact
  • Twitter
  • Facebook
  • Linkedin
  • Google My business for BigBear.ai
1-410-312-0885
[email protected]
  • Privacy Policy
  • Terms of Use
  • Accessibility
  • Site Map
© BigBear.ai 2023
We value your privacy
We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
Privacy Policy | Do not sell my personal information
AcceptCookie Settings
Manage Consent

Cookies Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
CookieDurationDescription
cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
JSESSIONIDsessionThe JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
CookieDurationDescription
__atuvc1 year 1 monthAddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
__atuvs30 minutesAddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
Analytics
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
CookieDurationDescription
_ga2 yearsThe _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_NK4L4Q320Q2 yearsThis cookie is installed by Google Analytics.
_gat_gtag_UA_163894009_21 minuteSet by Google to distinguish users.
_gid1 dayInstalled by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
at-randneverAddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT2 yearsYouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
uvc1 year 1 monthSet by addthis.com to determine the usage of addthis.com service.
Advertisement
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
CookieDurationDescription
f5avraaaaaaaaaaaaaaaa_session_sessionbusinesswire.com cookie
loc1 year 1 monthAddThis sets this geolocation cookie to help understand the location of users who share the information.
VISITOR_INFO1_LIVE5 months 27 daysA cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSCsessionYSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devicesneverYouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-idneverYouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
Save & Accept