Glossary

Applications programs (‘application’, or ‘app’, for short) are computer programs designed to perform a group of functions or tasks for a user’s benefit. Word processors, media players and web browsers are some examples of applications.

Also called ‘BI’. The analysis of an organisation’s raw data in order to gain actionable business insight.

Big Data is the term used to identify data sets that are so large that traditional data processing methods do not suffice. When analysed accurately, Big Data can support more accurate decision-making within an enterprise.

Also called CRM, this refers to the methods and technologies that companies use to manage and analyse customer interactions in order to improve those interactions and increase customer retention and profitability.

Analytics is the process by which meaningful patterns are uncovered in data. This is often achieved by applying statistical methods to large, detailed datasets using computer software. The output from data analytics is typically presented visually using specialist data visualisation tools.

The combining of data residing in different sources to provide a unified view of that data.

The development and execution of plans, programs, and practices that control, protect and deliver stored data.

The process of transferring data between storage systems, formats, or computer systems.

A field of computer science that involves discovering patterns in large data sets and transforming that data into usable information.

The assessment of data to determine its fitness to serve its given purpose.

A form of direct marketing that uses databases of customers or potential customers to generate personalised, targeted communications.

A central repository of integrated data from one or more sources, both current and historical, for reporting and analysis.

Extract, Transform, and Load; The extracting of data from one or many sources, the transformation of it into a standard format or structure, and the loading of it into a final target.

A standardised model displaying a detailed representation of an organisation’s data, standardising attributes and the relationships between them.

A field of computer science that explores the construction of algorithms that can learn from, and make predictions on, data.

A central computer (usually supplied by IBM) in a large organisation, used primarily for critical applications and block data processing.

A large number of processors, or separate computers, perform a set of computations in parallel. MPP systems are linearly scalable, in that doubling the number of processors doubles the processing power, examples include Teradata, Netezza and Greenplum.

The method of enabling an organisation to link all of its critical data to one file as a single point of reference.

The improvement of a system’s performance, increasing its ability to handle a higher workload, or to execute the existing workload in a shorter timeframe.

Greenplum is MPP relational database management software used to manage data warehouses. Greenplum can be deployed as an appliance, on a customer’s preferred server/storage hardware or via the public cloud.

The translating of the logical data model into technology specific SQL statements that define a database.

A branch of advanced analytics focussed on making predictions about the future based on historical data, using techniques such as data mining.

Often described as consultants, firms who offer professional services provide specialised knowledge in a specific field.

A RDMS is a computer application that interacts with users, other applications and the database itself to capture and analyse data, based on the relational model. Examples include Teradata, Oracle, SQL Server, DB2 and Greenplum.

Also called SMP. The processing of programs by multiple processors sharing a common operating system and memory. These shared features affect the scalability of SMP.

Teradata is an MPP relational database management system used to manage data warehouses, which can be deployed as an appliance, as a private cloud or via the public cloud.

The highest qualification available in Teradata.

A very large database, or VLDB, is a database that contains an extremely high number of database rows, or occupies an extremely large physical filesystem storage space.