aster.cloud aster.cloud
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
    • Learning
  • Tools
  • About
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
    • Learning
  • Tools
  • About
aster.cloud aster.cloud
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
    • Learning
  • Tools
  • About
  • Data
  • Engineering

The Definitive Guide To Databases On Google Cloud: Part 1 – Data Modeling Basics

  • aster.cloud
  • April 30, 2022
  • 6 minute read

The next best thing since sliced bread!

As I wake up to the smell of tea and freshly ground cardamom every morning, all I could think of is how nice it would be with some soft buttery sliced bread!
Hmm… if only it wasn’t for the gluten allergy.

Anyway, the one thing that is half as old as sliced bread but twice as good as that is Database! The set of comprehensive application programs that can be leveraged to store, access, manage and update data whilst assuring structure, recovery, security, concurrency and more.


Partner with aster.cloud
for your next big idea.
Let us know here.



From our partners:

CITI.IO :: Business. Institutions. Society. Global Political Economy.
CYBERPOGO.COM :: For the Arts, Sciences, and Technology.
DADAHACKS.COM :: Parenting For The Rest Of Us.
ZEDISTA.COM :: Entertainment. Sports. Culture. Escape.
TAKUMAKU.COM :: For The Hearth And Home.
ASTER.CLOUD :: From The Cloud And Beyond.
LIWAIWAI.COM :: Intelligence, Inside and Outside.
GLOBALCLOUDPLATFORMS.COM :: For The World's Computing Needs.
FIREGULAMAN.COM :: For The Fire In The Belly Of The Coder.
ASTERCASTER.COM :: Supra Astra. Beyond The Stars.
BARTDAY.COM :: Prosperity For Everyone.

Yup exactly 52 years ago, E.F. Codd, the father of Database Management Systems (DBMS) propounded and formalized these as 12 commandments that are in fact 13 in number (starting from 0. I know! right?) that make up a DBMS. We have evolved since the 1960s when we used one database to store and secure information to these modern times where we use one-database-per-stage in the data lifecycle, in fact one database per data stage, type and structure in most cases.

In this blog we are going to discuss the business attributes, technical aspects, design questions, considerations to keep in mind while “Designing the Database Model” (and if you hang on till the end, a simple contest to ensure you have been entertained thoroughly!). And if you ask…

Is there a quick and dirty choice?

The short answer is NO. However your selection of database can be derived based on the answers to a multitude of database business requirement questions including the top 3 mandatory ones below:

1. What is the stage in the data lifecycle?

Starting from Ingestion (landing data from different sources in one place), Storage (staging in a persistent location for use later), Processing (transformational stages for analysis) to Visualization (derive and present insights from the analysis), we need to be aware of the stage in the lifecycle of data for which we are designing / discussing storage requirements.

2. What is the type of data you are bringing in?

There are broadly 3 types of data we would be dealing with that would highly influence the choice of database and storage.

Read More  Introducing Vertical Autoscaling In Streaming Dataflow Prime Jobs

a. Application: that covers transactional and  event based data

b. Live Streaming (Real-time): that covers data from real time sources

c.  Batch: that covers bulk, scheduled interval and  event triggered data

Real-time data is immediate and constantly up to date, the integration of this type of data needs to be carried out at the time of the event. Whereas, Batch data process is scheduled at specific times and amounts.

3. Is your data Structured, Unstructured or Hybrid?

a. Structured data is modeled with rows, columns and are mostly transactional and analytical in nature

b. Unstructured could be anything like images, audio files etc. The amount of unstructured data is much larger than that of structured data, so the methods by which we store such data is more important than ever. This means that companies not taking unstructured data into account could be missing out on a lot of crucial business intelligence.

c.  The semi-structured (hybrid) data are the ones with attributes defined but could vary for each record. The major differentiating factor for each kind of semi-structured data is in the way they are retrieved / accessed

Engineering or architecting, ask the right data questions!

There is always this question around the responsibilities concerning data.

If you design data architecture, manage business and technology requirements around the architecture, involve in design of data extraction, transformation, loading and provide direction to the team for methods of organizing, formats and presentation of data, then you are an Architect.

As an Engineer you create applications and develop solutions to enable data for distribution, processing, analysis and participate in one or more of those activities directly.

But in either case, you are an expert, you need to ask the right questions and need to set the right expectation as you approach the technical aspects of data. It is not always possible to get “the one” solution with these questions below but will definitely help get started and eliminate the mismatches easily right off the start.

Read More  Proactive Database Wellness With Active Assist: Introducing Additional Cloud SQL Recommenders

Volume and Scalability:

  • What is the size of data you are going to be dealing with at the time of design and at each stage in the lifecycle of the data?
  • How much do you expect it to scale with time?

 

Velocity:

  • What is the rate / schedule at which the data needs to be sent and processed?

 

Veracity:

  • What is the variation expected to be seen in the data incoming?

 

Security:

  • How much access restriction does your data need (Row level, object level, fine grained levels of access control), encryption, privacy and compliance?

 

And other most common areas of design consideration are Availability, Resilience, Reliability and Portability.

Choosing the right database

Having assessed all these questions and considerations, the logical next step is to choose from / eliminate from the database types out there.

 

  1. We have the good old Relational Database for the Online Transaction Processing (OLTP) that typically follow normalization rules and Online Analytical Processing (OLAP) that are typically used for Data Mart and Data Warehouse applications. This type requires Structured Query Language to define, manipulate, query and manage.
  2. And then we have the NOSQL database types for the semi-structured i.e. less structured than Relational database.

 

There is no formal model or normalization requirement for this type. Key-Value pair DB, Document DB, Wide Column DB, Graph DB are some types of NOSQL databases. More on each of these technologies to be covered in upcoming episodes, code labs and PoCs of the blog series.

 

Before I go

Phew, you would think that’s it for now. Not quite. Let me leave you on a fun note. All this while we have been talking about the types, requirements, design aspects, database choices, and what not.

Here is a simple exercise to flex your understanding so far:

How would you model a NoSQL solution for an application that needs to query the lineage between individual entities that are represented in pairs? (E.g. If A-B, B-C, C-E, A-E, D-F are the row of records that are in pairs, your application should represent A,B,C,E belonging to one lineage and D,F belonging to another).

Read More  Announcing Sovereign Controls For Google Workspace

Tips for Modeling a NoSQL database:

What are the design questions that come to your mind?

Does NOSQL have a schema?

Sometimes it’s misleading when we hear that NoSQL options are schema-less. They do not have a schema in the same strict way as the relational databases. However they have an underlying structure that is used to store the data. Each of the four main types of NoSQL databases is based on a specific way of storing data. Here is the logic for a data model in each case:

 

  1. Document databases store data in document data type, which is similar to a JSON document. Each document stores pairs of fields and values, with a wide variety of data types and data structures being used as values
  2. Key-Value database items consist of a key and a value, making this the simplest type of database. The data model consists of two parts: a string with some relationship to the data and the data. Data is retrieved using the direct request method (provide the key and get the data) rather than through the use of a query language
  3. Wide-column databases use a table form but in a flexible and scalable way. Each row consists of a key and one or more related columns, which are called column families. Each row’s key-column family can have different numbers of columns and the columns can have different kinds of data. Data is retrieved using a query language. This column structure enables fast aggregation queries
  4. Graph Databases consist of nodes connected by edges. Data items are stored in the nodes and the edges store information about how the nodes are related. Node and relationship information is typically retrieved using specialized query languages, sometimes SQL as well

 

Next Steps…

  • Take a few minutes to think through the exercise mentioned in the above section
  • Also you can go through more details of this reference architecture and database options here: https://cloud.google.com/architecture/data-lifecycle-cloud-platform

 

 

By: Abirami Sukumaran (Developer Relations Engineer)
Source: Google Cloud Blog


For enquiries, product placements, sponsorships, and collaborations, connect with us at [email protected]. We'd love to hear from you!

Our humans need coffee too! Your support is highly appreciated, thank you!

aster.cloud

Related Topics
  • Database Management System
  • Databases
  • Google Cloud
You May Also Like
Getting things done makes her feel amazing
View Post
  • Computing
  • Data
  • Featured
  • Learning
  • Tech
  • Technology

Nurturing Minds in the Digital Revolution

  • April 25, 2025
View Post
  • Engineering
  • Technology

Guide: Our top four AI Hypercomputer use cases, reference architectures and tutorials

  • March 9, 2025
View Post
  • Computing
  • Engineering

Why a decades old architecture decision is impeding the power of AI computing

  • February 19, 2025
View Post
  • Engineering
  • Software Engineering

This Month in Julia World

  • January 17, 2025
View Post
  • Engineering
  • Software Engineering

Google Summer of Code 2025 is here!

  • January 17, 2025
View Post
  • Data
  • Engineering

Hiding in Plain Site: Attackers Sneaking Malware into Images on Websites

  • January 16, 2025
View Post
  • Computing
  • Design
  • Engineering
  • Technology

Here’s why it’s important to build long-term cryptographic resilience

  • December 24, 2024
IBM and Ferrari Premium Partner
View Post
  • Data
  • Engineering

IBM Selected as Official Fan Engagement and Data Analytics Partner for Scuderia Ferrari HP

  • November 7, 2024

Stay Connected!
LATEST
  • college-of-cardinals-2025 1
    The Definitive Who’s Who of the 2025 Papal Conclave
    • May 7, 2025
  • conclave-poster-black-smoke 2
    The World Is Revalidating Itself
    • May 6, 2025
  • oracle-ibm 3
    IBM and Oracle Expand Partnership to Advance Agentic AI and Hybrid Cloud
    • May 6, 2025
  • 4
    Conclave: How A New Pope Is Chosen
    • April 25, 2025
  • Getting things done makes her feel amazing 5
    Nurturing Minds in the Digital Revolution
    • April 25, 2025
  • 6
    AI is automating our jobs – but values need to change if we are to be liberated by it
    • April 17, 2025
  • 7
    Canonical Releases Ubuntu 25.04 Plucky Puffin
    • April 17, 2025
  • 8
    United States Army Enterprise Cloud Management Agency Expands its Oracle Defense Cloud Services
    • April 15, 2025
  • 9
    Tokyo Electron and IBM Renew Collaboration for Advanced Semiconductor Technology
    • April 2, 2025
  • 10
    IBM Accelerates Momentum in the as a Service Space with Growing Portfolio of Tools Simplifying Infrastructure Management
    • March 27, 2025
about
Hello World!

We are aster.cloud. We’re created by programmers for programmers.

Our site aims to provide guides, programming tips, reviews, and interesting materials for tech people and those who want to learn in general.

We would like to hear from you.

If you have any feedback, enquiries, or sponsorship request, kindly reach out to us at:

[email protected]
Most Popular
  • 1
    Tariffs, Trump, and Other Things That Start With T – They’re Not The Problem, It’s How We Use Them
    • March 25, 2025
  • 2
    IBM contributes key open-source projects to Linux Foundation to advance AI community participation
    • March 22, 2025
  • 3
    Co-op mode: New partners driving the future of gaming with AI
    • March 22, 2025
  • 4
    Mitsubishi Motors Canada Launches AI-Powered “Intelligent Companion” to Transform the 2025 Outlander Buying Experience
    • March 10, 2025
  • PiPiPi 5
    The Unexpected Pi-Fect Deals This March 14
    • March 13, 2025
  • /
  • Technology
  • Tools
  • About
  • Contact Us

Input your search keywords and press Enter.