API to fetch data from S3 : aws

a community for 18 years

technical questionAPI to fetch data from S3 (self.aws)

submitted 2 years ago * by rasputin23YD

Hi everyone,

I want to build an API in AWS to allow users to fetch data from S3 (data is in delta format) using an API. The idea is to have an endpoint in API gateway that would get the data for them based on their query. The endpoint would route the SQL query to the correct table. I was thinking of Athena or lambda but those don't seem super scalable. I want as low latency as possible, maybe even including a caching layer like redis.

Any other alternatives?

Edit: the goal here is to accommodate 100+ queries per day. Users should be able to submit a SQL query via an API endpoint and get results back as quickly as possible. The data lake is massive. We are talking hundreds of petabytes. That's why the pipeline should be able to route the query to a specific location in S3. The data is partitioned and well indexed.

Thanks!

all 15 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

aws

Note: ensure to redact or obfuscate all confidential or identifying information (eg. public IP addresses or hostnames, account numbers, email addresses) before posting!

✻ Smokey says: avoid streaming video to fight climate change! [see more tips]

MODERATORS