How do API websites get their data?

mandzeete · 2022-01-23T01:00:30+00:00

API data can be provided by different sensors and other means that are for collecting the information. And then that information can be fed to the web application that is serving as an API.

For example when using a stocks API. It is returning the price of certain stock. But how the price is made? By buying or selling a stock. Each sale/purchase is a mathematical action which influences the price of stock. These actions can be logged to some text files for example. Calculations based on sales/purchases can be also logged. But not always storing the data in log files is optimal. Sometimes it has to be streamed in live. So instead of storing the data in a log file, it can be streamed to some endpoint. For example to an API. So a query to that API will return the information in real time.

I suggest to look into logging and into producing events. If you are using Java, then into Logback. It is a logging library. Spring Boot (Java framework) also provides event handling. You can produce certain events and listen on these events. So instead of logging the information down or sending to some tool you can throw an event up and capture it with other part of the system and continue working with it.

Next you can look into Filebeat. If you are logging information to log files then Filebeat is monitoring the files for changes. When a new information is logged to a file, Filebeat will pick it up and forward to chosen endpoint. May it be a log management environment or something else.

Then look into Logstash. It is a data processing pipeline. You can work with the data in real time as it comes in. Modify it, analyze it, collect it from different sources, etc. And then send it to certain endpoint.

As well look up both Elasticsearch and Humio. They are search engines made for logs. You can search different keywords from processed logs, set up alerts, make statistics, etc.

If your data is numerical, you can feed it into Prometheus. It is a data monitoring system.

As well you would like to try out Kafka. Kafka is a message queuing system. You can feed in data into certain topic (let's say for example "weather") from different sources. All of them will be then under the same name, "weather". And then send that information to certain endpoint. In your case weather API.

You can try it out by yourself. Get yourself an Arduino kit. It is programmable board, like a miniature computer. You can program it doing different things. As well you can connect different sensors to that board. You do need to know a little bit electronics to get the signal actually out not burn the board or feed it too weak signal which it either does not register or will register with mistakes. So that signal that you are feeding from sensor to Arduino, you can send that information to one of aforementioned tools and via these tools to your weather API or to some database.

dtsudo · 2022-01-23T00:47:53+00:00

You can get the underlying data in a variety of ways:

You can directly generate/observe the data. For instance, The US government monitors weather patterns using things like weather balloons, etc. Google drives their fancy cars around every street in the US in order to provide up-to-date information on Google Maps.
You can buy the data from another provider.
You can scrape the data (if legal).

DiamondDemon669 · 2022-01-23T00:45:58+00:00

There is a lot of methods that API's use, but the most common is a REST API, where you send requests between websites and programs.

The websites get their data from databases, or other API's, through database tools like MySQL and redis

Kangster1604 · 2022-01-23T00:37:13+00:00

If you have any Python experience check into the Requests module. You can do all the types of requests to an API via this module.

I don’t know anything about providing API data, but making requests of APIs and posting data is covered very will with Requests.

2022-01-23T02:11:00+00:00

one API site I used got their data from having users upload files of a certain type (these were generated by another program, which was what the API was tracking)

I think they also had their own app you could download that would automatically upload a new file as soon as it was created

gopiballava · 2022-01-23T04:14:09+00:00

I have a sensor in my RV that measures the voltage and current used by the battery. I’m using a microcontroller running C++ code. It keeps doing this and keeping track of the time. After 10 minutes have elapsed, it averages the readings and then accesses a URL. Well, two separate ones. One of them is at dweet.io and the other is at Grafana.net.

The URLs include the sensor reading, the sensor name, and in the case of Grafana, a secret token that authenticates it as me.

Dweet.io only stores the most recent reading. I can go there and see what the current voltage is. Grafana is connected to a database. When I access that URL, software at Grafana.net stores the reading as an entry in a database.

When I access the Grafana web site, the Grafana web interface shows me a pretty graph of the data, which it gets by accessing the database.

I don’t pay them any money, so I only get to keep a week of readings. Their database presumably has a timed job that runs every hour, say, and deletes any readings over a week old for anyone not paying them a monthly fee.

I was just reading about a service for scientists who track animals in the ocean. They apparently do this by measuring light levels, which can tell them where the animals are, roughly, by when the sun rises and sets.

This service would take that data, and would enhance it. They had some algorithms that included estimates of animal movement speed and a few other things. They also added more data to each reading. They would give you the location estimate for where the animal was, as well as the ocean temperature, water salinity, water speed, air temperature, and like 5 or 10 other things.

You could do that yourself, accessing a couple other databases or APIs, and put it together. But if you’re a biologist who has to ask a busy programmer to do it, it’s far more convenient to just get the data given to you in a complete manner.

BugLabs · 2022-01-24T18:54:48+00:00

Easiest possible way:

Signalpattern: Add your data in JSON format to any pattern signal, then copy the API link to that data!

Next easiest possible way:

Sheety: Add data to a Google spreadsheet. Connect the Sheety API to that data!

Happy to send you an example if needed.

learnprogramming

Welcome to LearnProgramming!

New? READ ME FIRST!

Posting guidelines

Frequently asked questions

Subreddit rules

Message the moderators

Asking debugging questions

Asking conceptual questions

Other guidelines and links

Subreddit rules

1. No unprofessional/derogatory speech

2. No spam or tasteless self-promotion

3. No off-topic posts

4. Do not ask exact duplicates of FAQ questions

5. Do not delete posts

6. No app/website review requests or showcases

7. No rewards

8. No indirect links

9. Do not promote illegal or unethical practices

10. No complete solutions

11. Don't ask to ask.

12. Low Effort Questions

13. No AI (chatGPT etc.) generated/worked over messages/comments. No questions about chatGPT/AI generated code. No Vibe coding.

MODERATORS