Vous êtes sur la page 1sur 5

Microsoft Big Data

Solution Sheet

This solution sheet is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY. 2011 Microsoft Corporation

CONTENTS
Introduction ......... . .............................3

Microsoft Big Data Solution . . . . . . . . .. . . . . . . . . . . . . . . . . . . . . . 4 Broadening Access to Hadoop. . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 Enterprise ready Hadoop. . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . 5 Breakthrough Insights . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . 5

This solution sheet is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY. 2011 Microsoft Corporation

Key customer Challenges:


Data explosion, driven by declining hardware cost and new data sources Greater variety of data - customers need to analyze both relational and non-relational data Over 80% of data captured is unstructured

Introduction
Todays organizations face growing challenges extracting business value from their data: First, the relentless growth of data continues, due to the proliferation of new devices and sensors, and rapidly declining hardware cost. More organizations now store terabytes and even petabytes of data. Second, data complexity is increasing as customers store both structured data in relational format and unstructured data such as Word or PDF files, images, videos and geo-spatial data. Indeed industry analysts confirm that over 80% of data captured is in unstructured format. Finally customers are also challenged by the velocity of data organizations that process streaming data such as click streams from web sites, need to update data in real time to serve the right advert or present the right offers to their customers. Microsoft has been doing Big Data long before it was megatrend in the market: At Bing we analyze over 100 petabytes of data to deliver high quality search results. More broadly, Microsoft provides a range of solutions to help customers address big data challenges. Our family of data warehouse solutions from Microsoft SQL Server 2008 R2, SQL Server Fast Track Data Warehouse, Business Data Warehouse and SQL Server 2008 R2 Parallel Data Warehouse offer a robust and scalable platform for storing and analyzing data in a traditional data warehouse. Parallel Data Warehouse (PDW) offers customers: Enterprise-class performance that handles massive volumes to over 600 TB. We also provide LINQ to HPC (High Performance Computing) a distributed runtime and a programming model for technical computing. In addition to our traditional capabilities mentioned above, Microsoft is embracing Apache HadoopTM as part of an end to end roadmap to deliver on our vision of providing business insights to all users by activating new types of data of any size.

Increased velocity of data requiring organizations to respond quickly to rapidly changing data The need to explore data interactively with few preconceived questions

This solution sheet is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY. 2011 Microsoft Corporation

Microsoft Big Data Solution


Microsofts vision is to provide business insights to all users from any data, including insights previously hidden in unstructured data. To achieve this goal Microsoft will ship an Apache HadoopTM based distribution for Windows Server and Windows Azure to help accelerate its adoption in the Enterprise. This new Hadoop based distribution from Microsoft enables customers to derive business insights on structured and unstructured data of any size and activate new types of data. Rich insights from Hadoop can be combined seamlessly with the Microsoft Business Intelligence Platform.

Broadening Access to Hadoop


Microsoft is committed to broadening the accessibility and usage of Hadoop to users, developers and IT professionals. The new Hadoop based distribution for Windows offers IT professionals ease of use by simplifying the acquisition, installation and configuration experience. Thanks to smart packaging of Hadoop and its toolset, customers can install and deploy Hadoop in hours instead of days. End users can use the Hive ODBC Driver or Hive Add-in for Excel to analyze data from Hadoop using familiar tools such as Microsoft Excel and award winning BI clients such as PowerPivot for Excel.

Key Benefits
Broader access of Hadoop to end users, IT professionals and Developers, through easy installation and configuration and simplified programming with JavaScript Enterprise-ready Hadoop distribution with greater security, performance and ease of management Breakthrough insights through the use of familiar tools such as PowerPivot for Excel, SQL Server Analysis and Reporting Services

Our Big Data solution also offers interoperability with other Hadoop distributions, enabling customers to derive insights from several sources. Two Hadoop Connectors: First, we offer 2 Hadoop connectors that enable customers to move data seamlessly between Hadoop and SQL Server or SQL Server Parallel Data Warehouse. These 2 Hadoop connectors are now available to existing customers. Hive ODBC Driver, plus Excel Hive AddIn: Second, we offer a new Hive ODBC Driver and an Excel Hive Add-in that enable customers to move data from Hive directly into Excel, or Microsoft BI tools such as PowerPivot, for analysis.

Outline of Microsoft's Big Data Solution For developers, Microsoft is investing to make JavaScript a first class language within Big Data by making it possible to write high performance Map/Reduce jobs using JavaScript. In addition, our JavaScript console will allow users to write JavaScript Map/Reduce jobs, Pig-Latin, and Hive queries from the browser to execute their Hadoop jobs. This is the sort of innovation that Microsoft hopes to contribute back as proposals to the community.

This solution sheet is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY. 2011 Microsoft Corporation

Enterprise ready Hadoop


To accelerate its adoption in the Enterprise, Microsoft will make Hadoop Enterprise ready by Active Directory Integration: Providing Enterprise-class security through integration of Hadoop with Active Directory High Performance: Boosting Hadoop performance to offer consistently high data throughput System Center Integration: Simplifying management of the Hadoop infrastructure through integration with Microsofts management tools such as System Center BI Integration: Enabling integration of relational and Hadoop data into Enterprise BI solution with Hadoop connectors Flexibility and Choice with deployment options for Windows Server and Windows Azure which offers customers:
o Freedom to choose: More control as they can choose which data to keep in-house instead of the cloud. Lower TCO: Cost saving, as fewer resources are required to run their Hadoop deployment in the cloud

Breakthrough Insights
Microsofts Big Data solution offers breakthrough insights by enabling customers to combine the richness of relational data from databases with unstructured data from Hadoop. Our Hadoop based distribution for Windows Server and Windows Azure enables customers to: Analyze Hadoop data with familiar tools such as Excel, thanks to a Hive Add-in for Excel Reduce time to solution through integration of Hive and Microsoft BI tools such as PowerPivot and Power View Build corporate BI solutions that include Hadoop data, through integration of Hive and leading BI tools such as SQL Server Analysis Services and Reporting Services

The Hive ODBC driver allows customers to move data from Hive directly into either Microsoft Excel or SQL Server BI tools such as SQL Server Analysis Services, Reporting Services, PowerPivot and Power View for rich data visualization. These insights can be incorporated into dashboards for consumption by decision makers and stakeholders.

Elasticity to meet demand: Elasticity reduces your costs, since more nodes can be added to the Windows Azure deployment for more demanding workloads. In addition, the Azure deployment of Hadoop can be used to extend the on premise solution in periods of high demand
Increased Performance: Bringing computing closer to the data our solution enables customers to process data closer to where data is born, whether on premise or in the cloud

As mentioned earlier, our broader goal is to make Hadoop accessible to a broader class of developers, IT professionals and end users, by providing enterprise class Hadoop based distributions on Windows and by enabling all users to derive breakthrough insights from any data.

Additional Information
For more information on Microsofts Big Data solution, go to www.microsoft.com/bigdata Download the Hadoop connector for SQL Server from www.microsoft.com/download/en/details.aspx?id =27194

We do this while maintaining compatibility with existing Hadoop tools such as Pig, Hive, and Java. Our goal is to ensure that applications built on Apache Hadoop can be easily migrated to our distribution to run on Windows Azure or Windows Server.

This solution sheet is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY. 2011 Microsoft Corporation