Build an efficient number counting function

Tuesday, 11/04/2023

Tram Ho

Problem

The problem is to track the number of requests sent to the system so that it can be displayed in the backend in detail for each day, month or year.
Due to the characteristics of a large number of requests sent to the system, up to millions of requests per day, the processing of calculating the number of requests, designing a long-term storage database, and efficient querying are issues that need to be solved.

General idea

1. Handling saving the number of requests

1.1 Ideas

Starting with the idea everyone thinks of, for each request sent we will call 1 command +1 request to the db

Problem:

We have to wait for the extra call to the db to slow down the response
The large number of requests sent at the same time can increase the db load, which can affect the processing speed of the entire service.

1.2 Improvement

How to add the number of requests quickly, while minimizing the number of commands that call into the db?
We need to tweak the architecture a bit

Instead of handling the addition of calls to the db, we will call redis because processing I/O using redis is very fast, and can handle a large number of requests at the same time.
We will save information about the total number of requests in a day, after every day there will be 1 service calling to redis, updating information to the db.
=> This avoids calling the db too much

2. Efficient handling of long-term storage & querying

1.1 Ideas

Because we need to track the number of requests in detail, hourly. So we need to store the number of requests per hour in the db.
I use MongoDB so we can save the following information:

{
   partner_id: 12345,
   timestamp: ISODate("2023-03-01T01:00:00.000Z"),
   count: 40
}
{
   partner_id: 12345,
   timestamp: ISODate("2023-03-01T02:00:00.000Z"),
   count: 100
}
.....
{
   partner_id: 12345,
   timestamp: ISODate("2023-03-01T23:00:00.000Z"),
   count: 20
}

{

partner_id: 12345,

timestamp: ISODate("2023-03-01T01:00:00.000Z"),

}

{

partner_id: 12345,

timestamp: ISODate("2023-03-01T02:00:00.000Z"),

}

.....

{

partner_id: 12345,

timestamp: ISODate("2023-03-01T23:00:00.000Z"),

}

Problem:

Simply saving information as above will make it difficult for us to self-aggregate data every day, then every month, every year.
The speed of querying a large number of records and needing to aggregate them makes the search speed slow

1.2 Improvement

1.We can apply Bucket Pattern: It is simply to group data of the same type into one record to save memory, and easily query
I can save all the total number of messages by hour in a day with a single record

{
   partner_id: 12345,
   date: "2023-03-01",
   counts: {
      1: 40,
      2: 100,
      .....,
      23: 20
   },
   total_count: 2000
}

{

partner_id: 12345,

date: "2023-03-01",

counts: {

1: 40,

2: 100,

.....,

23: 20

total_count: 2000

}

With this way of saving, we can easily query information by day. So what to do by month by year?
Similarly, we can also save monthly information as follows:

{
  partner_id: 12345,
  date: "2023-03",
  counts: {
     1: 2000,
     2: 2023,
     .......,
     31: 3100
  },
  total_count: 100000
 }

{

partner_id: 12345,

date: "2023-03",

counts: {

1: 2000,

2: 2023,

.......,

31: 3100

total_count: 100000

}

=> So we can query by day, month, year very effectively and simply

2. In order to have the above query data, the synchronization step needs to be updated into 3 records (day, month, year) instead of only 1 record as before.

Build an efficient number counting function

Problem

General idea

1. Handling saving the number of requests

1.1 Ideas

1.2 Improvement

2. Efficient handling of long-term storage & querying

1.1 Ideas

1.2 Improvement

Other problems

1. How should information be saved in redis?

2. Handling syncing after every day

TikTok becomes the second largest social platform in South Africa

The fastest depreciating after 9 months of launch, iPhone 14 Pro Max continues to break the bottom in Vietnam

Beginner's guide to R: Introduction

10 essential SublimeText plugins for JavaScript developers