0% found this document useful (0 votes)
146 views

03 - IBM Watsonx - Data Introduction For Clients

Uploaded by

Ming Le
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
146 views

03 - IBM Watsonx - Data Introduction For Clients

Uploaded by

Ming Le
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 31

IBM watsonx.

data
Scale AI Workloads
For All Your Data, Anywhere
Content by:
Kevin Shen
Product Manager | Data & AI Software
[email protected]

Anson Kokkat
Product Manager | Data & AI Software
[email protected]

Joshua Kim
Program Director | Data & AI Software
[email protected]

Adam Learmonth
Advisory, Learning Content Development
[email protected]

Presenter:
Ahmad Muzaffar Baharudin
Technical Enablement Specialist | Data & AI
[email protected]
Massive early Broad-reaching Critical focus of AI
adoption & deep impact activity & investment

The speed, scope,


80%
Generative AI could Generative AI
raise global GDP by expected to represent
and scale of
Generative AI of enterprises are
working with or planning 7% 30%
impact is to leverage foundation within 10 years of overall market
unprecedented models and adopt by 2025
generative AI

Sources: Statista; Reuters; Goldman Sachs; IBM


Institute for Business Value; Gartner. Scale Zeitgeist:
AI Readiness Report, a survey of more than 1,600
executives and ML practitioners
2
However, leaders are faced with unprecedented
data challenges to scale AI
This environment leads to more cost and complexity
for those who seek to govern data for AI.

There’s more data In more locations In more formats With less quality

Exploding data growth Multiple locations, clouds, Documents, images, video Stale and inconsistent
applications and silos
The aggregate volume of data 80% of time is spent on 82% of enterprises say data
stored is set to grow over 82% of enterprises are data cleaning, integration quality is a barrier on their
250% in the next 5 years. inhibited by data silos. and preparation. data integration projects.

Source: https://www.idc.com/getdoc.jsp?containerId=US49018922) 3
Traditional approaches to addressing these challenges have created more overall
complexity and cost, which has led to the emergence of data lakehouse architectures

Early 2000s

Today, leaders at most large


enterprises manage their data
and workloads using a mix of data
repositories and data stores in
hybrid environments.
The overall cost across all these
repositories remains high.
It’s difficult for leaders to effectively
leverage and govern the data across
multiple environments and use
enterprise data for analytics and AI.

4
1
Enterprise leaders Ability to scale AI while supporting
require a data compliance with lineage and
architecture that can reproducibility of data
provide quick access
to data, centralized
governance and

2
fit-for-purpose use. Real-time analytics and BI that can
connect to existing data in minutes
without expensive duplicating
or moving of data

3
Data sharing and self-service access
for more users and more data while
strengthening governance and security

5
Introducing…

watsonx

6
The platform
for AI and data watsonx.ai watsonx.data watsonx.governance
Build, train, validate, tune and Scale AI workloads, for all Accelerate responsible,
deploy AI models your data, anywhere transparent and explainable AI
workflows

watsonx Fit-for-purpose data store, built on End-to-end toolkit for AI


A next generation enterprise
studio for AI builders to build, an open lakehouse architecture, governance across the entire model
train, validate, tune, and deploy supported by querying, governance lifecycle to accelerate responsible,
Scale and both traditional machine
learning and new generative AI
and open data formats to access
and share data.
transparent, and explainable AI
workflows
accelerate the capabilities powered by
foundation models. It enables
impact of AI with you to build AI applications in a
trusted data. fraction of the time with a
fraction of the data.

7
IBM watsonx Enable fine-tuned models to be
managed through market leading
governance and lifecycle
The platform management capabilities
Leverage foundation
for AI and data models to automate data
search, discovery, and
linking in watsonx.data
Scale and watsonx.governance
accelerate the
impact of AI watsonx.ai
with trusted data.
watsonx.data

Leverage governed enterprise


data in watsonx.data to
seamlessly train or fine-tune
foundation models

1 Prompting

watsonx.data watsonx.ai watsonx.governance


2 Prompt
Tuning Scale a workloads, Train, validate, tune Enable responsible,
for all your data, and deploy AI transparent and
3 Fine-tuning anywhere models explainable AI workflows

4
Training from
scratch

8
Put AI to work with watsonx
Scale and accelerate the impact of AI with trusted data

Leverage foundation models to automate data


search, discovery and linking in watsonx.data

watsonx.ai watsonx.data watsonx.governance


Build, train, validate, Scale AI workloads, for Accelerate responsible, transparent
tune and deploy AI data, anywhere and explainable AI workflows
models

Leverage governed enterprise data in watsonx.data


to seamlessly train or fine-tune foundation models

Enable fine-tuned models to be managed through market-leading


governance and lifecycle management capabilities

9
What IBM offers

IBM approach for AI: Unleash the intelligence in your business


Consulting Center of Excellence for Generative AI and Client Engineering for watsonx System Integrators,
for AI Software, and
SaaS partners

AI products Digital Labor IT Automation Security Sustainability Application


Modernization

AI and data watsonx


platform watsonx.ai
watsonx.data
watsonx.governance

Hybrid cloud Red Hat


platform OpenShift AI
Ansible Lightspeed

Infrastructure zSystems AWS/Azure/Other


for AI Distributed Infrastructure

10
watsonx.data

An open, hybrid, and governed fit-for-


purpose data store optimized to scale all
data, analytics and
AI workloads.

11
watsonx.data
Hybrid Cloud Built-in Governance Open Source

Scale AI workloads, Access all your data Get started in minutes Reduce the cost of
for all your data, through a single point with built-in a data warehouse
anywhere of entry across all governance, security by up to 50%*
clouds and on-premises and automation. through workload
A fit-for-purpose data environments. optimization across
store, based on an open multiple query engines
lakehouse architecture, and storage tiers.
supported by querying,
governance and open
data formats to access
and share data

*When comparing published 2023 list prices normalized for VPC hours of IBM watsonx.data to several major cloud data
warehouse vendors. Savings may vary depending on configurations, workloads and vendors. 12
Access all your data across
hybrid cloud through a
single point of entry
An open data store, based on an
open lakehouse architecture built
for hybrid deployment of your data,
analytics, and AI workloads

13
watsonx.data
Hybrid Cloud Built-in Governance Open Source

Scale AI workloads, Access all your data Get started in minutes Reduce the cost of
for all your data, through a single point with built-in a data warehouse
anywhere of entry across all governance, security by up to 50%*
clouds and on-premises and automation. through workload
A fit-for-purpose data environments. optimization across
store, based on an open multiple query engines
lakehouse architecture, and storage tiers.
supported by querying,
governance and open
data formats to access
and share data

*When comparing published 2023 list prices normalized for VPC hours of IBM watsonx.data to several major cloud data
warehouse vendors. Savings may vary depending on configurations, workloads and vendors. 14
Get started in minutes Connect to your existing analytics data and deploy
fit-for-purpose query engines in minutes

with built-in governance,


security and automation
Accelerate time to trusted analytics and AI
Address enterprise compliance and security using built-in
centralized governance across your data ecosystem

Use foundation models to discover, augment, refine and


visualize watsonx.data data and metadata

15
watsonx.data
Hybrid Cloud Built-in Governance Open Source

Scale AI workloads, Access all your data Get started in minutes Reduce the cost of
for all your data, through a single point with built-in a data warehouse
anywhere of entry across all governance, security by up to 50%*
clouds and on-premises and automation. through workload
A fit-for-purpose data environments. optimization across
store, based on an open multiple query engines
lakehouse architecture, and storage tiers.
supported by querying,
governance and open
data formats to access
and share data

*When comparing published 2023 list prices normalized for VPC hours of IBM watsonx.data to several major cloud data
warehouse vendors. Savings may vary depending on configurations, workloads and vendors. 16
Reduce your data
warehouse costs by
up to 50%* by
optimizing workloads
Optimize workloads from your data
warehouse when you take advantage
of low-cost object storage and
fit-for-purpose query engines

17
Access all your data, quickly and optimize your data architecture with multi-engine
support and hybrid deployment of analytics and AI workloads

1 3
Public cloud

Optimize costly cloud warehouses


Cloud warehouse Cloud data lake Make the most of fit-for-purpose query
engines and compute resources
2

Optimize & access on-premises warehouses


Use low-cost object storage and
fit-for-purpose engines
Hybrid

IBM watsonx.data
3 4

Modernize data lakes


Run existing reporting and enable
new AI workloads without the cost
and complexity of Hadoop
1 2 3 4
On-premises

Deploy across hybrid cloud and multicloud


Seamlessly deploy to both the public cloud
On-premises warehouses On-premises data lake
and to your existing on-premises investment
2 4

Types of workloads
Structured Unstructured
Technology
Proprietary Open

18
The IBM approach to Best-in-class cost Built-in integrations with Deep expertise and
a data lakehouse and performance IBM data repositories capabilities in data
architecture combines optimizations for and data fabric and storage
the best of IBM with compute and storage
the best of open source

Open and vendor- Enables hybrid, The best of open source


agnostic across multicloud deployments
architectural tiers with the Red Hat
OpenShift platform

19
Effortlessly populate with trusted data leveraging best-in-class
data ingestion and observability

Files RDBMS Applications REST/API s Other Cloud


DWH
watsonx.data + IBM DataStage
Easily build EL(T) pipelines with an intuitive visual design
Monitor and detect
Easily ingest data anomalies from cloud-
from any source 1 2 3 based pipelines •
1 Ingest data from any source
I BM Databand
Leverage 60+ native connectors to ingest data into watsonx.data
I BM DataStage
from any type of source, ensuring top performance with built-in
DataStage Canvas engine scalability
Providing visibility in all data flows,
3 from source to target, supporting the
•2 Reduce cost by offloading data from cloud data warehouses
Easy to use low-code/no-code editor with 60+ native
connectors modern data stack
Monitor and
detect data
Offload data from cloud data warehouses to enable shifting workloads
DataStage Engine anomalies like BI, reporting, or data science to fit-for-purpose query engines
during data
DataStage Engine ingestion
DataStage Engine

Built-in scalability to ensure top

watsonx.data + IBM Databand


performance loading data into
watsonx.data

watsonx.data
Continuously detect and resolve data quality incidents
1 3
I ngest data into Continuously

3 Monitor, detect, and resolve data quality incidents
storage layer; monitor and Monitor and improve the health of DataStage, Spark, or Python
use fit-for-
purpose
Metadata store improve health
of Spark/Python
pipeline workloads running on watsonx.data; detect data anomalies
Access control management
watsonx.data pipelines and accelerate issue resolution
query engines
for BI /reporting/
data science

20
Overview of the key components of IBM watsonx.data: Multiple query
engines, open table formats, and built-in enterprise governance

watsonx.data
Your existing
Data warehouse Data lake
ecosystem

Core watsonx.data functionality


Query engines Ecosystem infrastructure

Governance
and metadata Metadata store
Access control management

Data format

Storage

Infrastructure

22
watsonx.data

23
Use cases

Deploy AI/ML Apply real-time Streamline data Share data


at scale analytics and BI engineering responsibly
Build, train, tune, deploy, Combine data from existing Reduce data pipelines, Enable self-service access
and monitor trusted AI and sources with new data in simplify data transformation, for more users to more
ML models for mission- watsonx.data to unlock new, and enrich data for
critical workloads with data faster insights without the cost data while you strengthen
consumption using SQL,
in IBM watsonx.data; and complexity of duplicating Python or an AI-infused security and compliance
strengthen compliance with and moving data across conversational interface. with centralized governance
lineage and reproducibility different environments. and local automated
of data used for AI. policy enforcement.

24
Powered by

Digital advertising platform Ride-hailing, micromobility Social media


rentals, and food delivery
in Europe and Africa

Over 2000 daily reports and Up to 100,000 daily queries 30,000 queries per day with
100s of pipelines on a 7 PB (over 1.5 million queries per 1000 daily active users on
data lake with over 400 month) with over 2000 active a 300 PB data lake
billion records internal users on 2 PB data lake

Ride-hailing, food delivery Internet technology Communications


API technology

Over 500,000 queries per day Over 2 million queries per Over 2700 active internal
with 7000 weekly active users day for business intelligence users running 1 million
on a 50 PB data lake and one-off use cases queries scanning 40 PB
of data per month

25
watsonx.data “We look forward to partnering "We’re excited to see
with IBM to optimize the how watsonx can help us
watsonx.data stack and drive predictive analytics,
contributing to the open- identify fraud, and optimize
is helping companies source community.” our marketing.”
scale their AI workloads.

Das Kamhout Bahaa’ Awartany


VP and Senior Principal Engineer Chief Data Officer
Intel Capital Bank of Jordan

“Customers will benefit “We believe watsonx.data


from a truly open and will help enterprises lower
interoperable hybrid storage costs, optimize
data platform that compute, and ensure
fuels the adoption of AI.” seamless data management.”

Paul Codding Ashish Baghel


EVP of Product Management CEO and Founder
Cloudera NucleusTeq
26
Analysts agree,
IBM is a leader
in the data
management
market

Forrester Wave: Gartner Magic Quadrant Forrester: The Total


Data Management for Cloud Database Economic ImpactTM for
for Analytics Management Solutions IBM Data Management

Sources: Forrester, Gartner 27


What IBM offers

Why IBM?

Open IBM’s AI is based on the


best open technologies available

Trusted IBM’s AI is transparent,


responsible, and governed

Targeted IBM’s AI is designed for enterprise


and targeted at business domains

Empowering IBM’s AI is for value


creators, not just users

28
Getting started

29
Three ways to get started with watsonx.data today
IBM’s investment in partnering with clients

Free trial Client briefing Pilot program


Experience watsonx.data Discussion and custom demonstration Watsonx pilot developed with IBM
and test out core capabilities of IBM’s generative AI watsonx point- AI engineers. Prove watsonx.data
with a free trial. of-view and capabilities. Understand value for the selected use case(s)
how watsonx.data can be leveraged in with a plan for adoption.
any businesses AI strategy.

Try our free trial 2-4 hours 1-4 weeks

30
© 2023 International Business Machines Corporation

Thank you
IBM and the IBM logo are trademarks of IBM
Corporation, registered in many jurisdictions
worldwide. Other product and service names might be
trademarks of IBM or other companies. A current list
of IBM trademarks is available on ibm.com/trademark.

THIS DOCUMENT IS DISTRIBUTED “AS IS” WITHOUT


ANY WARRANTY, EITHER EXPRESS OR IMPLIED. IN
NO EVENT, SHALL IBM BE LIABLE FOR ANY DAMAGE
ARISING FROM THE USE OF THIS INFORMATION,
INCLUDING BUT NOT LIMITED TO, LOSS OF DATA,
BUSINESS INTERRUPTION, LOSS OF PROFIT OR
LOSS OF OPPORTUNITY.

Client examples are presented as illustrations of how


those clients have used IBM products and the results
they may have achieved. Actual performance, cost,
savings or other results in other operating
environments may vary.

Not all offerings are available in every country in which


IBM operates.

IBM’s statements regarding its plans, directions, and


intent are subject to change or withdrawal without
notice at IBM’s sole discretion. Information regarding
potential future products is intended to outline our
general product direction and it should not be relied
on in making a purchasing decision. The information
mentioned regarding potential future products is not
a commitment, promise, or legal obligation to deliver
any material, code or functionality. Information about
potential future products may not be incorporated into
any contract. The development, release, and timing of
any future features or functionality described for our
products remains at our sole discretion.

Red Hat and OpenShift are registered trademarks of


Red Hat, Inc. or its subsidiaries in the United States
and other countries.

31

You might also like