Azure Data Lake Store

Official Documentation

Service Description

Azure Data Lake Store is an enterprise-wide hyper-scale repository for big data analytic workloads. Azure Data Lake enables you to capture data of any size, type, and ingestion speed in one single place for operational and exploratory analytics.

Azure Data Lake Store can be accessed from Hadoop (available with HDInsight cluster) using the WebHDFS-compatible REST APIs. It is specifically designed to enable analytics on the stored data and is tuned for performance for data analytics scenarios. Out of the box, it includes all the enterprise-grade capabilities—security, manageability, scalability, reliability, and availability—essential for real-world enterprise use cases.

Getting Started

  1. Azure Data Lake Store Learning Path
    10/3/2016, Webpage
  2. Introducing Azure Data Lake
    11/21/2016, Mva
  3. Introducing Azure Data Lake
    8/6/2017, Mva
  4. Azure Data Lake, and The Multi-Clustered Shared Storage Architecture
    5/23/2017, Blog
  5. HDInsight on Azure Data Lake Store
    6/24/2016, Video, 0:17:33

Latest Content

Subscribe to News about Azure Data Lake Store

Title  
Blog
Video
Blog
Blog
Blog
Blog
Blog
Video
Video
Blog
Blog
Blog
more...

Azure Documentation

1. Overview
     1.1. Overview of Azure Data Lake Store
     1.2. Compare Azure Data Lake Store with Azure Storage
     1.3. Use Azure Data Lake Store for big data processing
     1.4. Open source applications that work with Azure Data Lake Store
2. Get started
     2.1. Using Portal
     2.2. Using PowerShell
     2.3. Using .NET SDK
     2.4. Using Java SDK
     2.5. Using REST API
     2.6. Using Azure CLI 2.0
     2.7. Using Node.js
     2.8. Using Python
3. How to
     3.1. Copy Data
          3.1.1. Using Azure Data Factory
          3.1.2. Using AdlCopy
          3.1.3. Using DistCp
          3.1.4. Using Sqoop
          3.1.5. Upload data from offline sources
          3.1.6. Migrate Azure Data Lake Store across regions
     3.2. Secure Data
          3.2.1. Security overview
          3.2.2. Access control in Data Lake Store
          3.2.3. Secure data in Data Lake Store
          3.2.4. Service-to-service authentication
          3.2.5. End-user authentication
          3.2.6. Encryption
     3.3. Performance
          3.3.1. Performance tuning guidance for Azure Data Lake Store
          3.3.2. Performance tuning guidance for Spark on HDInsight and Azure Data Lake Store
          3.3.3. Performance tuning guidance for Hive on HDInsight and Azure Data Lake Store
          3.3.4. Performance tuning guidance for MapReduce on HDInsight and Azure Data Lake Store
          3.3.5. Performance tuning guidance for Storm on HDInsight and Azure Data Lake Store
     3.4. Integrate with Azure Services
          3.4.1. Access from VMs in Azure VNET
          3.4.2. Use with Data Lake Analytics
          3.4.3. HDInsight with Data Lake Store - Portal
          3.4.4. HDInsight with Data Lake Store as default storage - PowerShell
          3.4.5. HDInsight with Data Lake Store as additional storage - PowerShell
          3.4.6. HDInsight with Data Lake Store - Azure template
          3.4.7. Use with Data Factory
          3.4.8. Use with Stream Analytics
          3.4.9. Use with Power BI
          3.4.10. Use with Data Catalog
          3.4.11. Use with PolyBase in SQL Data Warehouse
          3.4.12. Use with SQL Server Integration Services
          3.4.13. More Azure integration options
     3.5. Manage
          3.5.1. Access diagnostic logs
          3.5.2. Plan for high availability
4. Reference
     4.1. Code samples
     4.2. PowerShell
     4.3. .NET
     4.4. Java
     4.5. Node.js
     4.6. Python (Account Mgmt.)
     4.7. Python (Filesystem Mgmt.)
     4.8. REST
     4.9. Azure CLI 2.0
5. Resources
     5.1. Azure Roadmap
     5.2. Data Lake Store Blog
     5.3. Give feedback on UserVoice
     5.4. MSDN Forum
     5.5. Pricing
     5.6. Pricing calculator
     5.7. Service updates
     5.8. Stack Overflow Forum
     5.9. Videos

Web Content

Content Type
Azure Data Lake Store Learning Path Webpage

Online Training Content

Date Title
8/6/2017 Introducing Azure Data Lake
5/24/2017 Processing Big Data with Azure Data Lake Analytics
11/21/2016 Introducing Azure Data Lake

Tools

Tool Description
Azure Data Lake Tools for Visual Studio Azure Data Lake Tools for Visual Studio
AdlCopy Tool zum Kopieren von Daten zwischen Azure Blob Storage und Azure Data Lake Store

Videos

Date Title Length
8/1/2017 Connecting On-premises Hadoop to Azure Data Lake Store 0:18:58
5/10/2017 Demystifying Cloud Data Services for an App Developer 0:33:50
5/9/2017 Loading Data into Azure SQL DW using Polybase 0:17:22
3/20/2017 Cloud Tech 10 - 20th March 2017 0:09:57
2/10/2017 Deep Dive of SSIS 2016 + vNext 1:02:17
11/16/2016 Azure Data Lake GA! 0:20:52
9/30/2016 Build your fully managed, petabyte-scale, secure data store with Azure Data Lake Store 1:15:08
8/18/2016 Azure Data Lake: PowerShell, CLI, SDKs, and APIs 0:10:49
7/13/2016 Advancements in Data Technology 0:29:42
6/24/2016 HDInsight on Azure Data Lake Store 0:17:33

Page 1 of 2

StackOverflow

Date Title