Database model

A database model is a type of data model that determines the logical structure of a database. It fundamentally determines in which manner data can be stored, organized and manipulated. The most popular example of a database model is the relational model, which uses a table-based format.

Types

Common

logical data models

for databases include:

Hierarchical database model

This is the oldest form of database model. It was developed by IBM for IMS (information Management System), and is a set of organized data in tree structure. DB record is a tree consisting of many groups called segments. It uses one-to-many relationships, and the data access is also predictable.

An object–relational database combines the two related structures.

Physical data models

include:

Inverted index
Flat file

Other models include:

Multidimensional model
Multivalue model
Semantic model
XML database
Named graph
Triplestore

Relationships and functions

A given database management system may provide one or more models. The optimal structure depends on the natural organization of the application's data, and on the application's requirements, which include transaction rate (speed), reliability, maintainability, scalability, and cost. Most

database management systems

are built around one particular data model, although it is possible for products to offer support for more than one model.

Various

physical data models

can implement any given logical model. Most database software will offer the user some level of control in tuning the physical implementation, since the choices that are made have a significant effect on performance.

A model is not just a way of structuring data: it also defines a set of operations that can be performed on the data.^[1] The relational model, for example, defines operations such as select (project) and join. Although these operations may not be explicit in a particular query language, they provide the foundation on which a query language is built.

Flat model

The

flat (or table) model consists of a single, two-dimensional array of data

elements, where all members of a given column are assumed to be similar values, and all members of a row are assumed to be related to one another. For instance, columns for name and password that might be used as a part of a system security database. Each row would have the specific password associated with an individual user. Columns of the table often have a type associated with them, defining them as character data, date or time information, integers, or floating point numbers. This tabular format is a precursor to the relational model.

Early data models

These models were popular in the 1960s, 1970s, but nowadays can be found primarily in old legacy systems. They are characterized primarily by being navigational with strong connections between their logical and physical representations, and deficiencies in data independence.

Hierarchical model

In a

Information Management System (IMS) by IBM, and now describe the structure of XML

documents. This structure allows one-to-many relationship between two types of data. This structure is very efficient to describe many relationships in the real world; recipes, table of contents, ordering of paragraphs/verses, any nested and sorted information.

This hierarchy is used as the physical order of records in storage. Record access is done by navigating downward through the data structure using pointers combined with sequential accessing. Because of this, the hierarchical structure is inefficient for certain database operations when a full path (as opposed to upward link and sort field) is not also included for each record. Such limitations have been compensated for in later IMS versions by additional logical hierarchies imposed on the base physical hierarchy.

Network model

The

network model expands upon the hierarchical structure, allowing many-to-many relationships in a tree-like structure that allows multiple parents. It was most popular before being replaced by the relational model, and is defined by the CODASYL

specification.

The network model organizes data using two fundamental concepts, called records and sets. Records contain fields (which may be organized hierarchically, as in the programming language COBOL). Sets (not to be confused with mathematical sets) define one-to-many relationships between records: one owner, many members. A record may be an owner in any number of sets, and a member in any number of sets.

A set consists of circular linked lists where one record type, the set owner or parent, appears once in each circle, and a second record type, the subordinate or child, may appear multiple times in each circle. In this way a hierarchy may be established between any two record types, e.g., type A is the owner of B. At the same time another set may be defined where B is the owner of A. Thus all the sets comprise a general directed graph (ownership defines a direction), or network construct. Access to records is either sequential (usually in each record type) or by navigation in the circular linked lists.

The network model is able to represent redundancy in data more efficiently than in the hierarchical model, and there can be more than one path from an ancestor node to a descendant. The operations of the network model are navigational in style: a program maintains a current position, and navigates from one record to another by following the relationships in which the record participates. Records can also be located by supplying key values.

Although it is not an essential feature of the model, network databases generally implement the set relationships by means of pointers that directly address the location of a record on disk. This gives excellent retrieval performance, at the expense of operations such as database loading and reorganization.

Popular DBMS products that utilized it were Cincom Systems' Total and Cullinet's IDMS. IDMS gained a considerable customer base; in the 1980s, it adopted the relational model and SQL in addition to its original tools and languages.

Most object databases (invented in the 1990s) use the navigational concept to provide fast navigation across networks of objects, generally using object identifiers as "smart" pointers to related objects. Objectivity/DB, for instance, implements named one-to-one, one-to-many, many-to-one, and many-to-many named relationships that can cross databases. Many object databases also support SQL, combining the strengths of both models.

Inverted file model

In an inverted file or

database indexes

, which might only use the contents from a particular columns in the lookup table. The inverted file data model can put indexes in a set of files next to existing flat database files, in order to efficiently directly access needed records in these files.
Notable for using this data model is the ADABAS DBMS of Software AG, introduced in 1970. ADABAS has gained considerable customer base and exists and supported until today. In the 1980s it has adopted the relational model and SQL in addition to its original tools and languages.
Document-oriented database Clusterpoint uses inverted indexing model to provide fast full-text search for XML or JSON data objects for example.

Relational model

Two tables with a relationship

Main article: Relational model

The
predicate logic and set theory
, and implementations of it have been used by mainframe, midrange and microcomputer systems.
The products that are generally referred to as relational databases in fact implement a model that is only an approximation to the mathematical model defined by Codd. Three key terms are used extensively in relational database models: relations, attributes, and domains. A relation is a table with columns and rows. The named columns of the relation are called attributes, and the domain is the set of values the attributes are allowed to take.
The basic data structure of the relational model is the table, where information about a particular entity (say, an employee) is represented in rows (also called tuples) and columns. Thus, the "relation" in "relational database" refers to the various tables in the database; a relation is a set of tuples. The columns enumerate the various attributes of the entity (the employee's name, address or phone number, for example), and a row is an actual instance of the entity (a specific employee) that is represented by the relation. As a result, each tuple of the employee table represents various attributes of a single employee.
All relations (and, thus, tables) in a relational database have to adhere to some basic rules to qualify as relations. First, the ordering of columns is immaterial in a table. Second, there can not be identical tuples or rows in a table. And third, each tuple will contain a single value for each of its attributes.
A relational database contains multiple tables, each similar to the one in the "flat" database model. One of the strengths of the relational model is that, in principle, any value occurring in two different records (belonging to the same table or to different tables), implies a relationship among those two records. Yet, in order to enforce explicit
integrity constraints
, relationships between records in tables can also be defined explicitly, by identifying or non-identifying parent-child relationships characterized by assigning cardinality (1:1, (0)1:M, M:M). Tables can also have a designated single attribute or a set of attributes that can act as a "key", which can be used to uniquely identify each tuple in the table.
A key that can be used to uniquely identify a row in a table is called a primary key. Keys are commonly used to join or combine data from two or more tables. For example, an Employee table may contain a column named Location which contains a value that matches the key of a Location table. Keys are also critical in the creation of indexes, which facilitate fast retrieval of data from large tables. Any column can be a key, or multiple columns can be grouped together into a compound key. It is not necessary to define all the keys in advance; a column can be used as a key even if it was not originally intended to be one.
A key that has an external, real-world meaning (such as a person's name, a book's
social security number
, except when the social security numbers are incorrect, missing, or have changed.)
The most common query language used with the relational model is the Structured Query Language (SQL).

Dimensional model

The
OLAP
queries. In the dimensional model, a database schema consists of a single large table of facts that are described using dimensions and measures. A dimension provides the context of a fact (such as who participated, when and where it happened, and its type) and is used in queries to group related facts together. Dimensions tend to be discrete and are often hierarchical; for example, the location might include the building, state, and country. A measure is a quantity describing the fact, such as revenue. It is important that measures can be meaningfully aggregated—for example, the revenue from different locations can be added together.
In an OLAP query, dimensions are chosen and the facts are grouped and aggregated together to create a summary.
The dimensional model is often implemented on top of the relational model using a star schema, consisting of one highly normalized table containing the facts, and surrounding denormalized tables containing each dimension. An alternative physical implementation, called a snowflake schema, normalizes multi-level hierarchies within a dimension into multiple tables.
A data warehouse can contain multiple dimensional schemas that share dimension tables, allowing them to be used together. Coming up with a standard set of dimensions is an important part of dimensional modeling.
Its high performance has made the dimensional model the most popular database structure for OLAP.

Post-relational database models

Products offering a more general data model than the relational model are sometimes classified as post-relational.
E.F. Codd
's Information Principle, which requires that

all information in the database must be cast explicitly in terms of values in relations and in no other way
— ^[4]

Some of these extensions to the relational model integrate concepts from technologies that pre-date the relational model. For example, they allow representation of a directed graph with
GraphDB
.
Some post-relational products extend relational systems with non-relational features. Others arrived in much the same place by adding relational features to pre-relational systems. Paradoxically, this allows products that are historically pre-relational, such as PICK and MUMPS, to make a plausible claim to be post-relational.
The resource space model (RSM) is a non-relational data model based on multi-dimensional classification.^[5]

Graph model

Main article: Graph database

Graph databases allow even more general structure than a network database; any node may be connected to any other node.

Multivalue model

Main article:
MultiValue

Multivalue databases are "lumpy" data, in that they can store exactly the same way as relational databases, but they also permit a level of depth which the relational model can only approximate using sub-tables. This is nearly identical to the way XML expresses data, where a given field/attribute can have multiple right answers at the same time. Multivalue can be thought of as a compressed form of XML.
An example is an invoice, which in either multivalue or relational data could be seen as (A) Invoice Header Table - one entry per invoice, and (B) Invoice Detail Table - one entry per line item. In the multivalue model, we have the option of storing the data as on table, with an embedded table to represent the detail: (A) Invoice Table - one entry per invoice, no other tables needed.
The advantage is that the atomicity of the Invoice (conceptual) and the Invoice (data representation) are one-to-one. This also results in fewer reads, less referential integrity issues, and a dramatic decrease in the hardware needed to support a given transaction volume.

Object-oriented database models

Example of an object-oriented model

Main articles:
Object–relational model and Object model

In the 1990s, the
encapsulation and polymorphism
, into the world of databases.
A variety of these ways have been tried ^[
which?
] have attacked the problem from the database end, by defining an object-oriented data model for the database, and defining a database programming language that allows full programming capabilities as well as traditional query facilities.
Object databases suffered because of a lack of standardization: although standards were defined by
ODMG, they were never implemented well enough to ensure interoperability between products. Nevertheless, object databases have been used successfully in many applications: usually specialized applications such as engineering databases or molecular biology databases rather than mainstream commercial data processing. However, object database ideas were picked up by the relational vendors and influenced extensions made to these products and indeed to the SQL
language.
An alternative to translating between objects and relational databases is to use an object–relational mapping (ORM) library.

See also

Database design

References

Wikimedia Commons has media related to Database models.

ISBN 9780133970777
.

^ E.F. Codd (1970). "A relational model of data for large shared data banks". In: Communications of the ACM archive. Vol 13. Issue 6(June 1970). pp.377-387.

ISBN 0-17-012731-1
, p. 69.

^ Date, C. J. (June 1, 1999). "When's an extension not an extension?". Intelligent Enterprise. 2 (8).

ISBN 978-0-387-72771-4
.

v
t
e
Database models
Common models

Flat

Hierarchical

Dimensional

Network

Relational

Entity–relationship
Enhanced

Graph

Object-oriented

Entity–attribute–value

Other models

Multi-dimensional

Array

Semantic

Star schema

XML database

Implementations

Flat file

Column-oriented

Document-oriented

Object–relational

Deductive

Temporal
Valid time

Transaction time

Decision time

XML data store

Key–value store

Ordered Key-Value Store

Triplestore

v
t
e
Data model
Main

Architecture

Modeling

Structure

Schemas

Conceptual

Logical

Physical

Types

Database

Data structure diagram

Entity–relationship model (enhanced)

Geographic

Generic

Semantic

Common

Related models

Data-flow diagram

Information model

Object model

Object–role modeling

Unified Modeling Language

See also

Database design

Business process modeling

Core architecture data model

Enterprise modelling

Function model

Process modeling

XML schema

Data Format Description Language

v
t
e
Database
Main

Requirements

Theory

Models

Database management system

Machine

Server

Application

Connection
datasource

DSN

Administrator

Lock

Types

Tools

Languages

Data definition

Data manipulation

Query
information retrieval

Security

Activity monitoring

Audit

Forensics

Negative database

Design

Entities and relationships (and Enhanced notation)

Normalization

Schema

Refactoring

Cardinality

Programming

Abstraction layer

Object–relational mapping

Management

Virtualization

Tuning
caching

Migration

Preservation

Integrity

Lists of

Academic

Biological

Biodiversity

Facial expression

Online

Online music

Online real estate

See also

Database-centric architecture

Intelligent database

Two-phase locking

Locks with ordered sharing

Load file

Publishing

Halloween Problem

Log shipping

WikiProject Category

v
t
e
Database management systems
Types

Object-oriented
comparison

Relational
list

comparison

Key–value

Column-oriented
list

Document-oriented

Wide-column store

Graph

NoSQL

NewSQL

In-memory
list

Multi-model
comparison

Cloud

Blockchain-based database

Concepts

Database

ACID

Armstrong's axioms

Codd's 12 rules

CAP theorem

CRUD

Null

Candidate key

Foreign key

Superkey

Surrogate key

Unique key

Objects

Relation
table

column

row

View

Transaction

Transaction log

Trigger

Index

Stored procedure

Cursor

Partition

Components

Concurrency control

Data dictionary

JDBC

XQJ

ODBC

Query language

Query optimizer

Query rewriting system

Query plan

Functions

Administration

Query optimization

Replication

Sharding

Related topics

Database models

Database normalization

Database storage

Distributed database

Federated database system

Referential integrity

Relational algebra

Relational calculus

Relational model

Object–relational database

Transaction processing

Category

Outline

WikiProject

v
t
e
Software engineering
Fields

Computer programming

DevOps

Empirical software engineering

Experimental software engineering

Formal methods

Requirements engineering

Search-based software engineering

Site reliability engineering

Social software engineering

Software deployment

Software design

Software maintenance

Software testing

Systems analysis

Concepts

Abstraction

Component-based software engineering

Software compatibility
Backward compatibility

Compatibility layer

Compatibility mode

Forward compatibility

Software incompatibility

Data modeling

Enterprise architecture

Functional specification

Modeling language

Programming paradigm

Software

Software archaeology

Software architecture

Software configuration management

Software development process/methodology

Software quality

Software quality assurance

Software verification and validation

Software system

Structured analysis
Essential analysis

CI/CD

Orientations

Agile

Aspect-oriented

Object orientation

Ontology

Service orientation

SDLC

Models
Developmental

Agile

EUP

Executable UML

Incremental model

Iterative model

Prototype model

RAD

UP

Scrum

Spiral model

V-model

Waterfall model

XP

Model-driven engineering

Round-trip engineering

Other

SPICE

CMMI

Data model

ER model

Function model

Information model

Metamodeling

Object model

Systems model

View model

Languages

IDEF

UML

USL

SysML

Related fields

Computer science

Computer engineering

Information science

Project management

Risk management

Systems engineering

Commons

Category

Retrieved from "https://en.wikipedia.org/w/index.php?title=Database_model&oldid=1205889225"

[Elmasri-1] ISBN 9780133970777
.

[2] E.F. Codd (1970). "A relational model of data for large shared data banks". In: Communications of the ACM archive. Vol 13. Issue 6(June 1970). pp.377-387.

[CONR-3] ISBN 0-17-012731-1
, p. 69.

[4] Date, C. J. (June 1, 1999). "When's an extension not an extension?". Intelligent Enterprise. 2 (8).

[5] ISBN 978-0-387-72771-4
.

[1]

[4]

[5]