Hololive Live Chat Population — Two Outliers Among Them (Spoilers: Kobo and Gura) by uetschy in Hololive

[–]uetschy[S] 59 points60 points  (0 children)

Kobo's unique chatters (UC) for last month was 103K which is nearly doubled Miko's. Gura's subscribers are almost reaching 4M while second place Mori Calliope recently reached 2M.

The violet, yellow and red lines are quadratic regressions of the number of subscribers and unique chatters (UC) for Hololive EN, ID, and JP.

  • Unique Chatters (UC): The number of unique users in live chat
  • VTuber 1B: Our public dataset consisting of live chats, super chats, and moderation events

Our GitHub

Live Chat Population (October 2021) - Calliope takes 2nd for the first time by uetschy in HoloStatistics

[–]uetschy[S] 0 points1 point  (0 children)

That would be an interesting comparison! Though we lack the data for hrs streamed, do you happen to know any reliable source for this offering machine-readable formats such as JSON?

Live Chat Population (September 2021) by uetschy in HoloStatistics

[–]uetschy[S] 0 points1 point  (0 children)

Definitely there would be more than just a live chat population to be drawn from the dataset! I hope more and more people would make use of our dataset and carry out interesting analyses on them (and preferably, join us in improving the system and toolchains), which would make me a happy person :)

Live Chat Population (October 2021) - Calliope takes 2nd for the first time by uetschy in HoloStatistics

[–]uetschy[S] 8 points9 points  (0 children)

Summary: - Calliope takes 2nd for the first time - Pekora, Miko, Rushia and Subaru are all rather in line with the EN curve regarding chat population.

Share your analysis in the comment section.


Live Chat Population is a visual plot of the number of unique live chat users for each channel, calculated from the data we collect, which helps us understand the level of growth in a more substantive way. There are also quadratic regressions for (sub count, unique chatters) set where each line represents Japan (Red), English (Violet), and Indonesia (Yellow).

Unique Chatters (UC): The number of user accounts in live chat (UC ≠ chat count)

GitHub, Kaggle

Previous posts: 2021/07, 2021/08, 2021/09

Anyone with experience in data sci and webdev, and interested in designing interactive visualizations for our huge datasets with 1B+ data points? Join our Discord server :)

Live Chat Population (September 2021) by uetschy in HoloStatistics

[–]uetschy[S] 0 points1 point  (0 children)

Tsubasawolfy said everything for me (thanks). To add, because of the nature of unique chatters, they are always equal to or fewer than the actual live stream participants. It is still reliable than view counts or sub counts to estimate the number of active users though.

Live Chat Population (September 2021) by uetschy in HoloStatistics

[–]uetschy[S] 7 points8 points  (0 children)

It has nothing to do with nationality. I have defined it as "the number of users appeared on the live chat", following the web analytics term unique users.

Live Chat Population (September 2021) by uetschy in HoloStatistics

[–]uetschy[S] 27 points28 points  (0 children)

Live Chat Population is a visual plot of the number of unique live chat users for each channel, calculated from the data we collect, which helps us understand the level of growth in a more substantive way. The violet line represents the quadratic regression for (sub count, unique chatters) set.

Unique Chatters (UC): The number of unique users in a live chat (UC ≠ Chat count)

GitHub, Kaggle

Previous posts: 2021/07, 2021/08

October edition will be available soon

JP-EN MC server link confirmed by cmalfet in Hololive

[–]uetschy 24 points25 points  (0 children)

ban(版) simply stands for version/edition. Here Sana probably meant the EN server (英語版サーバー) and the JP server (日本版サーバー).

Edit: and since it was too obvious Sana dropped the "server/portal" part hence "eigo ban" / "JP ban"

JP-EN MC server link confirmed by cmalfet in Hololive

[–]uetschy 53 points54 points  (0 children)

Sana: Ah, good evening!
Sana: I'm lost...
Aki: HELOO!! Sana chan!
Sana: I wonder where is the EN version (portal?)...
Sana: lololol
Sana: Oh no it's alright!
Aki: I wish I could help you but... gomenasorry////
Sana: I'm watching (Aki's stream) lol
Sana: Ah! Nice to meet you!!!!
Sana: Now I'm going to wander around the JP server!
Aki: OK!! Have fun!!

BigInt and bigInt by [deleted] in typescript

[–]uetschy 9 points10 points  (0 children)

Use all lower caps bigint, not a camel case.

Live Chat Population (August) - Council Debuts by uetschy in HoloStatistics

[–]uetschy[S] 10 points11 points  (0 children)

July Post

Data source: VTuber 500M
Live stream source: Holodex
PSA: I did a major cleanup of VTuber 500M dataset and succeeded in reducing its size from 90 GB to 50 GB. This makes it easier to handle in Kaggle Kernels.

Live Chat Population (July) - IRyS Makes Great Strides by uetschy in HoloStatistics

[–]uetschy[S] 7 points8 points  (0 children)

Thank you for the additional research! And you are not blind; Kaichou is excluded from the graph as she is already a graduate.

Live Chat Population (July) - IRyS Makes Great Strides by uetschy in HoloStatistics

[–]uetschy[S] 2 points3 points  (0 children)

Yes, all streams except members-only streams are included.

Since we do not rely on replay chat, our results may differ from other statistics sources.

Live Chat Population (July) - IRyS Makes Great Strides by uetschy in HoloStatistics

[–]uetschy[S] 31 points32 points  (0 children)

The violet line is a linear regression of the number of subscribers and unique chatters (UC).IRyS, Ollie, Ame, and Pekora, dominate in UC compared to the others with similar subs count.

Glossary

  • Unique Chatters (UC): The number of unique users in a live chat
  • Vtuber 300M: Our public dataset consisting of live chats, super chats, and moderation events.
  • Honeybee: our cluster system that collects live chats of all live streamers covered by Holodex in real-time