Getting started using Databricks Community Edition

posted in: Databricks | 1

image

Prerequisite: Signup for Community Edition – see how here

Now that you have your free account we can now get started using Databricks. For community edition make sure you bookmark the url for later.

When you sign-in, if you’re after Community Edition make sure the login looks like this

image

A really good place to start is the Databrick Quickstart with a notebook which can be found here.

Once you’re in your environment should look like this

CommunityEdition1

When I ran this quick start I wanted to use my own csv data (3rd party datasets) from the instructions here.

For this I wanted to use something more local so I choose to use Australian Open Data from here.

image

Once i have my dataset I can create a table from it in databricks. I want to stay on the least cost option so I’m going to upload it to Community Edition.

image

Then select your compute that you would have created when following the quick start. Be aware though on community edition they get stopped if you haven’t used them recently and this error can be a bit confusing at first

image

Pop over to your compute and you’ll most likely find it’s gone to sleep and you’ll want to restart it.

image

Unfortunately you can’t restart with Community Edition so you’ll need to delete it and create a new one. image

Then you can create your CSV File

image

Very similar to many CSV import interfaces to help you set the titles and override them etc

image

Now we can follow the getting started to create a notebook and start querying our new table

image

With all the assistants available now, rather than having to look up documentation I was drawn to the Databricks assistant button

image

This is not available on Community Edition

image