Navigation

Deploy a Data Lake

Beta

The Atlas Data Lake is available as a Beta feature. The product and the corresponding documentation may change at any time during the Beta stage. For support, see Atlas Support.

Estimated completion time: 15 minutes

This part of the tutorial will guide you through deploying an Atlas Data Lake.

Prerequisites

To complete this part of the tutorial, you will need to:

Procedure

1

Log in to MongoDB Atlas.

2

Select the Data Lake option on the left-hand navigation.

3

Click the Configure a New Data Lake.

4

Review the Overview, then click the green Configure a New Data Lake button.

Screenshot of the Data Lake Overview.
5

Enter the name for your Data Lake as you want it to appear in Atlas and click Next.

Screenshot of naming the Data Lake.
6

Enter the name of your S3 bucket then click Save & Continue.

Important

The bucket name must match the name specified in Part 1: Prepare Your S3 Bucket.

Screenshot of the Data Lake S3 bucket configuration.
7

Create an IAM user for Atlas and assign the required policy.

Follow the steps in the Atlas user interface to create a role and policy, then assign it to Atlas.

Important

Atlas displays the External ID and the Atlas AWS IAM user ARN for a Data Lake only once. You must save these values to a secure location to reconfigure your Data Lake in the future.

8

Validate your Data Lake configuration with the role ARN and bucket name.

Enter the role ARN and the bucket name then click Validate & Launch.

Screenshot of validating the Data Lake with the role ARN and bucket name.

To obtain the role ARN from the AWS console:

  1. Log in to the AWS Console.
  2. Click the Services dropdown menu on the upper left-hand side of the console.
  3. Under Security, Identity, & Compliance, select IAM.
  4. Select Roles from the left-hand navigation.
  5. Click the name of your newly-created role from the table.
  6. Copy the value next to the Role ARN label.

Next Steps

Now that your Data Lake is deployed, proceed to Connect to Your Data Lake.

Screenshot of the deployed Data Lake at this point.