Académique Documents
Professionnel Documents
Culture Documents
By David Loshin
Sponsored by:
2019
a Data-Centric 2 FOREWORD
Organization 3 NUMBER ONE: BE DATA-CENTRIC
Learn to be more data-centric
By David Loshin
4 NUMBER TWO: TRAIN CITIZEN ANALYSTS
Train everyone to be an analyst
10 AFTERWORD
Speeding time to value
T 425.277.9126 © 2018 by TDWI, a division of 1105 Media, Inc. All rights reserved. Reproductions in whole or in
part are prohibited except by written permission. Email requests or feedback to info@tdwi.org.
F 425.687.2842
E info@tdwi.org Product and company names mentioned herein may be trademarks and/or registered trademarks
of their respective companies. Inclusion of a vendor, product, or service in TDWI research does
tdwi.org
not constitute an endorsement by TDWI or its management. Sponsorship of a publication should
not be construed as an endorsement of the sponsor organization or validation of its claims.
FOREWORD
Over the past two decades, there has been a for provision of services (such as streaming
watershed transition in the ways that organizations entertainment), or to connect individuals (such as
consider the value of their enterprise information dating or recruiting companies).
assets. In the past, organizations typically
Recognize, though, that data centricity is not
overlooked that their core business functions
limited to new companies. Any organization can
revolved around information—instead focusing on
become data-centric by taking advantage of some
an application-centric (or business function-centric)
of the emerging trends discussed in this checklist.
approach to system design and implementation,
Here we will review some of the practices adopted
in which data is the byproduct of the operational
by data-centric organizations and suggest ways to
processes.
learn from their experiences.
The result is that these organizations rely on
systems that were solely designed to manage
transaction and operational processing, which
creates significant complexity and challenges when
attempting to introduce and sustain a business
intelligence, reporting, and analytics capability.
The typical mode of system organization for Other organizations, though, are successfully
many older enterprises tends to be application- rethinking how existing (and anticipated)
centric. This means that systems have been business processes can be redesigned around
designed to meet the transactional or operational their information landscape. These data-centric
needs of particular business functions such as organizations concentrate on the value of
marketing, sales, or finance. In these task-oriented information. They flip the context—instead of
environments, there is a system-oriented bias, and developing systems around functional expectations,
the data created and used by these applications is their processes are engineered around the data,
considered a secondary byproduct of accomplishing and reporting and analytics drive innovation.
the intended functional objectives. Data-centric organizations have an awareness of
their data landscape and understand the nature of
ENCOURAGING DATA CENTRICITY the content contained within their data objects.
ACROSS THE ORGANIZATION
Learn to be more data-centric. Identify the
WILL NOT ONLY INTRODUCE key information archetypes that are critical to
OPPORTUNITIES FOR IMPROVEMENT, organizational success (such as “customer,”
IT WILL INSPIRE INNOVATIVE “product,” “vendor,” etc.), and assess how these
information concepts are touched across the range
APPROACHES TO REENGINEERING of business functions.
BUSINESS PROCESS AROUND
Consider the practical aspects of data organization,
INFORMATION.
preparation, and integration for reporting
and analytics to gain critical insights from the
The challenge for these types of organizations is
information asset. Integrate the results of analyses
that despite growing interest in exploiting advanced
with operations to drive continuous optimization
analytics, these companies continue to agonize over
and improvement.
how to leverage their corporate information assets.
Two competing technical trends have established Better yet, the barriers to entry to becoming a
a fertile environment for data innovation. On the citizen analyst are low, and it is beneficial to train
one hand, increased data volume, variety, and everyone in the organization to be an analyst. Key
velocity have increased the complexity of the factors in this training include realizing the value of
data landscape. At the same time, improved tools information, ways to integrate different methods
coupled with increased user sophistication have of analysis, rapid prototyping of reports and visual
created a growing pool of savvy individuals whose presentations, blending analysis with visualization
expertise spans both the business and technical for data storytelling, and facilitating collaboration
sides. among the pool of analysts.
Gartner has defined a “citizen data scientist” as That being said, this training cannot be complete
“a person who creates or generates models that without helping citizen analysts overcome the
use advanced diagnostic analytics or predictive barriers of aging technology. Look for ways to build
and prescriptive capabilities, but whose primary a data stack that simplifies the development of
job function is outside the field of statistics and reporting and analysis platforms without requiring a
analytics.”1 significant background in technology.
We can expand the “citizenry” concept to a wider A data stack that simplifies reporting and analysis
collective of citizen analysts whose mandate is enables individuals to support their jobs without
broader than just building predictive models. being confounded by the traditional information
technology bottlenecks.
CITIZEN ANALYSTS ARE ESSENTIALLY
NONTECHNICAL INDIVIDUALS
POSITIONED AS BUSINESS-ORIENTED
PROBLEM SOLVERS.
1
Idoine, Carly, “Citizen Data Scientists and Why They Matter,” accessed November 5, 2018, from
https://blogs.gartner.com/carlie-idoine/2018/05/13/citizen-data-scientists-and-why-they-matter/
Citizen analysts are savvy enough to manage their The best way to break this logjam is to allow citizen
own end-to-end process for employing analysis analysts to bypass IT when there is no specific need
because they know how to: for IT support.
A byproduct of the emergence of the citizen data assets can be used; allow IT staff to exercise
analyst is the recognition that streamlining an their best judgment when it comes to organizing
analyst’s access to data often conflicts with the information and making data available. Some
types of organizational structures that have been example activities might include:
put into place for managing reporting and business
• DATA ASSET ORGANIZATION: identifying,
intelligence. Typical organizational structures such
classifying, and arranging data assets according
as a business intelligence competency center (BICC)
to those classifications
or other type of IT group intended to facilitate the
creation and delivery of reports and analyses have • DATA SERVICES: providing access methods and
recently become bottlenecks. services for end-user analysis tools to access
corporate data
Other organizations don’t even have a formalized
structure supporting business intelligence— • COMPUTATIONAL SUPPORT: overseeing both
members of the IT/engineering teams act as ad hoc on-premises and cloud-based platforms and
BI consultants to help produce requested reports tools enabling users to perform their analyses
and analyses. and generate reports
Eliminating IT as a bottleneck in the BI/analytics Encouraging IT to release control of the reporting
production process is critical for empowering function to the actual analysts not only creates
the community of citizen analysts. This becomes a more innovative environment, it reduces or
especially true as organizations embrace methods eliminates process bottlenecks and encourages
for providing analysts with self-service data access, intellectual freedom to create actionable knowledge
reporting, and analysis. New tools help automate for competitive value.
the process of configuring environments suited to
the citizen analyst for reporting, BI, and analytics. As
analysts are increasingly recognized as the subject
matter experts for transforming data into actionable
knowledge, the role of IT is bound to diminish.
2
Whitler, Kimberly A., “How is Data-Centricity Impacting the Marketing Organization?” June 2016, Forbes, accessed via
https://www.forbes.com/sites/kimberlywhitler/2016/06/18/how-is-data-centricity-impacting-the-marketing-organization-part-1/#1155f8a772f3
6 CELEBRATE AUTOMATION
If one of the trends is to reduce IT bottlenecks for • Data extraction, transformation, preparation,
reporting and analytics, we must also consider ways and loading (ETL)
to enable IT to retreat somewhat from complete
• Query analysis and query optimization
involvement in the end-to-end processes.
• Data warehouse resource management
When we recognize that much of the data
warehouse production and utilization cycles relies • Management of cloud data warehouse
on standardized structures, templates, and access environments
methods, it should not be a surprise that there
is a growing trend toward automating the whole • Scalable storage management
process of creating a reporting and analytics
• Continuous maintenance
environment, arranging for its population with
data, providing control to the data consumers, and • Cached query materialization
rapidly responding to requests for information.
Embracing these different aspects of data warehouse
SIMPLIFY THE WAY THAT THE automation helps to simplify the materialization of
analytical systems as well as speed the ability to
DATA STACK IS CONFIGURED AND deliver results.
INSTANTIATED AND DO SO USING
DATA WAREHOUSE AUTOMATION
TECHNOLOGY.
• Data connectivity
• Data ingestion
Another trend that has been growing over the past FOLLOW THE TREND AND ADOPT
decade is the transition from waterfall methods
A STRATEGY OF AGILE ANALYTICS.
of system design and development to the agile
methodology, and we can adapt the agile principles LOOK FOR PRODUCTS THAT SIMPLIFY
to developing reports and analyses. Some key THE PROCESS OF ADDING DATA
principles of the agile approach include: SOURCES TO THE ENVIRONMENT,
• Early and continuous delivery of a minimum FUSING THOSE DATA SETS INTO THE
viable product that serves the customers’ needs DATA WAREHOUSE, AND MAKING
• Adapting to changing requirements, even late THOSE DATA SETS ACCESSIBLE
in the development process VIA END-USER ANALYSIS AND
• Frequent delivery of products that can be VISUALIZATION PRODUCTS.
reviewed and used by the customer Automated methods for assessing metadata from
data sources help optimize storage orientation
• Continued iterative refinement and improvement
and query performance, thereby improving
sustained over an indefinite time period
data accessibility. In essence, this allows your
• Gaining synergy from business and technical organization to quickly embrace a wide variety of
people working together data sources and allow immediate accessibility for
analytical purposes.
• Maintaining simplicity and avoiding
unnecessary work3
3
“12 Principles Behind the Agile Manifesto,” Agile Alliance, accessed November 30, 2018, via
https://www.agilealliance.org/agile101/12-principles-behind-the-agile-manifesto/
• Leveraging automation