[RETIRED] Creating a Connection to Amazon Athena

athena

(Todd Nemet) #1

The content of this article has been moved into Looker’s technical documentation here.


AWS Athena support
Looker 4.6 Release Notes
(Justin Burnham) #2

Hi,

Are there any plans to support using IAM Roles to connect to Athena instead of having to pass a hardcoded access key and secret key? Our machines running Looker run on EC2 with a role that already gives them access to Athena and it would be nice and safer if Looker could just use that.

Thanks


(Morgan Imel) #3

Hi Justin,

It looks like you already got your answer from chat, but for the benefit of everyone else, this isn’t something we support right now but we’ve passed the feedback along!


(kenneth.vinson) #4

Just added the “Feature Support” section to this article.


(Openbridge) #5

Since Looker has native connections to AWS Athena you may be interested in how best to leverage Athena and Looker. Using a columnar storage format is ket to optimized performance and lowers per query costs. If you are looking to get started with Athena, this may be of interest. It has automated the setup and optimization of data pipelines for Looker users, including converting data to Apache Parquet format.

https://blog.openbridge.com/aws-athena-automated-60-second-setup-zero-administration-and-automatic-optimization-eba474e9897a)

If there is any interest, please let me know.


(brettg) #6

In looker version 5.20, the Athena driver has been upgraded to use the Athena JDBC API 4.2 (v2.0.2). The extra jdbc params setup in the looker connection will need to be updated to conform to the new connection URL construction:

s3_staging_dir=s3://staging-bucket&log_level=DEBUG&log_path=/tmp/athena_debug.log

Would become:

s3_staging_dir=s3://staging-bucket;LogLevel=DEBUG;LogPath=/tmp/athena_debug.log

See here for more details if needed to update custom JDBC configs or other specifics to your connection configuration.


(brettg) #7

Update to the comment above - The athena driver v2.0.5 is used in looker v5.22+. If you have the right policy attached to your athena user, you can add ;UseResultsetStreaming=1 to the end of your additional params in the connection setup to significantly improve the performance of large result set extraction (this parameter is set to 0 by default).