article thumbnail

Data governance in the age of generative AI

AWS Big Data

Data governance is a critical building block across all these approaches, and we see two emerging areas of focus. First, many LLM use cases rely on enterprise knowledge that needs to be drawn from unstructured data such as documents, transcripts, and images, in addition to structured data from data warehouses.

article thumbnail

LA Public Defender CIO digitizes to divert people to programs, not prison

CIO Business Intelligence

In total, it took the CIO’s team and agency a little over two years to convert 160 million documents into a transformed, revamped, and people-centric system, built on the Salesforce CRM, that tells their stories and focuses on people outcomes, not case outcomes.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Avoid generative AI malaise to innovate and build business value

CIO Business Intelligence

Capturing the “as-is” state of your environment, you’ll develop topology diagrams and document information on your technical systems. GenAI requires high-quality data. Ensure that data is cleansed, consistent, and centrally stored, ideally in a data lake. Assess your readiness.

Data Lake 134
article thumbnail

Data Mesh 101: How Data Mesh Can Be Used in an Organization

Ontotext

Domain teams should continually monitor for data errors with data validation checks and incorporate data lineage to track usage. Establish and enforce data governance by ensuring all data used is accurate, complete, and compliant with regulations. This calls for additional planning, documentation, and testing.

article thumbnail

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

AWS Big Data

As part of their cloud modernization initiative, they sought to migrate and modernize their legacy data platform. This popular open-source tool for data warehouse transformations won out over other ETL tools for several reasons. The tool also offered desirable out-of-the-box features like data lineage, documentation, and unit testing.

article thumbnail

Data Profiling: What It Is and How to Perfect It

Alation

For any data user in an enterprise today, data profiling is a key tool for resolving data quality issues and building new data solutions. In this blog, we’ll cover the definition of data profiling, top use cases, and share important techniques and best practices for data profiling today.

IT 52
article thumbnail

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

AWS Big Data

A data hub contains data at multiple levels of granularity and is often not integrated. It differs from a data lake by offering data that is pre-validated and standardized, allowing for simpler consumption by users. Data hubs and data lakes can coexist in an organization, complementing each other.