Introduction In the first part of our series on Data Governance, we looked into what is data governance, why we need it and what are the future trends.  In the second part, we discussed various Data Governance frameworks used by organisations to implement data governance.  In part 3, we will discuss various Data Governance operating models and services […]

INTRODUCTION In the first part of our series on Data Governance, we looked into what is data governance, why we need it and what are the future trends.  In part 2, we will discuss Data Governance frameworks as well as people and processes needed to implement data governance within an organisation.  DATA GOVERNANCE FRAMEWORK Data Governance is […]

INTRODUCTION TO DATA GOVERNANCE DEFINITION Data governance is a set of roles, processes, policies, metrics, and tools that ensures organisational data is secure, confidential, accurate, accessible, and usable. Organisations use data governance to decide how is data within the organization inputted, stored, handled, accessed, and deleted and who has the authority to do so. It also determines who are accountable to ensure compliance with any data-related regulation.  NEED FOR DATA GOVERNANCE As the oft-repeated saying goes: “Data is the new oil.” […]

Introduction The healthcare industry is in the midst of a digital revolution that is fuelled by rise of smart medical devices, learnings from global COVID-19 pandemic and commoditisation of IT hardware. The impact is felt in every aspect of healthcare, from prevention to treatment to recovery. Much of this is driven by the phenomenal growth […]

INTRODUCTION In 2011, Marc Andreessen, developer of Netscape Navigator and founder of Andreessen Horowitz, proclaimed “Software is eating the world”   Jay Kreps, CEO, and co-founder of Confluent, revised that assertion by stating “Every Company is Becoming Software”  So, it comes as no surprise that even a staid sector like banking, financial services, and insurance (BFSI) […]

INTRODUCTION APIs (Application Programming Interfaces) are the codes that allow services and applications to communicate and share information with one another. They speed up business processes by creating unified workflows and removing data silos without bringing additional complexities. They are the driving force behind new digital transformation initiatives and innovative customer experiences. Digital transformation is […]

PySpark is a Python API for Apache Spark to process larger datasets in a distributed cluster. It is written in Python to run a Python application using Apache Spark capabilities PySpark is a Python API for Spark released by the Apache Spark community to support Python with Spark. Using PySpark, one can easily integrate and […]