Azure Data Lake Store

Official Documentation

Service Description

Azure Data Lake Store is an enterprise-wide hyper-scale repository for big data analytic workloads. Azure Data Lake enables you to capture data of any size, type, and ingestion speed in one single place for operational and exploratory analytics.

Azure Data Lake Store can be accessed from Hadoop (available with HDInsight cluster) using the WebHDFS-compatible REST APIs. It is specifically designed to enable analytics on the stored data and is tuned for performance for data analytics scenarios. Out of the box, it includes all the enterprise-grade capabilities—security, manageability, scalability, reliability, and availability—essential for real-world enterprise use cases.

Getting Started

  1. Azure Data Lake Store Learning Path
    10/3/2016, Webpage
  2. Introducing Azure Data Lake
    11/21/2016, Mva
  3. Introducing Azure Data Lake Store
    10/27/2015, Video, 0:24:32
  4. HDInsight on Azure Data Lake Store
    6/24/2016, Video, 0:17:33

Azure Documentation

1. Overview
     1.1. Overview of Azure Data Lake Store
     1.2. Compare Azure Data Lake Store with Azure Storage
     1.3. Use Azure Data Lake Store for big data processing
     1.4. Open source applications that work with Azure Data Lake Store
2. Get started
     2.1. Using Portal
     2.2. Using PowerShell
     2.3. Using .NET SDK
     2.4. Using Java SDK
     2.5. Using REST API
     2.6. Using Azure CLI
     2.7. Using Node.js
     2.8. Using Python
3. How to
     3.1. Copy Data
          3.1.1. Using Azure Data Factory
          3.1.2. Using AdlCopy
          3.1.3. Using DistCp
          3.1.4. Using Sqoop
          3.1.5. Upload data from offline sources
          3.1.6. Migrate Azure Data Lake Store across regions
     3.2. Secure Data
          3.2.1. Security overview
          3.2.2. Access control in Data Lake Store
          3.2.3. Secure data in Data Lake Store
          3.2.4. Service-to-service authentication
          3.2.5. End-user authentication
     3.3. Performance
          3.3.1. Performance tuning guidance for Azure Data Lake Store
          3.3.2. Performance tuning guidance for Spark on HDInsight and Azure Data Lake Store
          3.3.3. Performance tuning guidance for Hive on HDInsight and Azure Data Lake Store
          3.3.4. Performance tuning guidance for MapReduce on HDInsight and Azure Data Lake Store
          3.3.5. Performance tuning guidance for Storm on HDInsight and Azure Data Lake Store
     3.4. Integrate with Azure Services
          3.4.1. Access from VMs in Azure VNET
          3.4.2. Use with Data Lake Analytics
          3.4.3. HDInsight with Data Lake Store - Portal
          3.4.4. HDInsight with Data Lake Store as default storage - PowerShell
          3.4.5. HDInsight with Data Lake Store as additional storage - PowerShell
          3.4.6. HDInsight with Data Lake Store - Azure template
          3.4.7. Use with Data Factory
          3.4.8. Use with Stream Analytics
          3.4.9. Use with Power BI
          3.4.10. Use with Data Catalog
          3.4.11. More Azure integration options
     3.5. Manage
          3.5.1. Access diagnostic logs
          3.5.2. Plan for high availability
4. Reference
     4.1. PowerShell
     4.2. .NET
     4.3. Java
     4.4. Node.js
     4.5. Python (Account Mgmt.)
     4.6. Python (Filesystem Mgmt.)
     4.7. REST
5. Resources
     5.1. Service updates
     5.2. Pricing
     5.3. MSDN Forum
     5.4. Stack Overflow Forum
     5.5. Give feedback on UserVoice
     5.6. Data Lake Store Blog
     5.7. Videos

Web Content

Content Type
Azure Data Lake Store Learning Path Webpage

Microsoft Virtual Academy (MVA)

Date Title
11/21/2016 Introducing Azure Data Lake

Tools

Tool Description
Azure Data Lake Tools for Visual Studio Azure Data Lake Tools for Visual Studio
AdlCopy Tool zum Kopieren von Daten zwischen Azure Blob Storage und Azure Data Lake Store

Videos

Date Title Length
11/16/2016 Azure Data Lake GA! 0:20:52
9/30/2016 Build your fully managed, petabyte-scale, secure data store with Azure Data Lake Store 1:15:08
8/18/2016 Azure Data Lake: PowerShell, CLI, SDKs, and APIs 0:10:49
7/13/2016 Advancements in Data Technology 0:29:42
6/24/2016 HDInsight on Azure Data Lake Store 0:17:33
3/30/2016 Data Integration in the Cloud and Building Data Analytics Pipelines 0:33:03
10/27/2015 Introducing Azure Data Lake Store 0:24:32

StackOverflow

Date Title