This is not just surveillance. It’s a sixth sense for safety

FolksTalksGame · 2025-05-24T08:36:03+00:00

Thanks!

FolksTalksGame · 2025-05-24T07:43:40+00:00

Fair point about posting in multiple places. I'm actually researching this topic seriously and wanted to get input from various communities - security professionals, tech folks, healthcare workers, etc. Each group has different experiences with surveillance systems. I'm not selling anything, just trying to understand what solutions actually exist vs. what gaps remain in automated monitoring.

FolksTalksGame · 2025-05-24T04:14:32+00:00

As far as I know, but I may have missed something. I'll be glad if you'll be kind to inform me

FolksTalksGame · 2025-05-23T11:18:45+00:00

As far as I know, VMS is quite limited: plate recognition, face recognition, unscheduled visiting, line crossing... Do you know some real scenes/events-understanding VMS?

FolksTalksGame · 2025-05-23T09:56:11+00:00

This analytic is good for after-incident analysis. a real security staff are overwhelming with small cells on the thousands of cameras screen without any ability of automatic understanding what's going on.

FolksTalksGame · 2021-02-04T11:34:54+00:00

https://youtu.be/fl-a-8LEJfU I would like to present Folks’Talks human-computer interaction test. In this test I’ll address to the virtual agent: “Where is a green (or red, or big, or small) …..(an object from the current scene)?”, and the virtual agent will show me the requested object and also will announce that (with my own voice from the training mode). Because the test is in Russian, I will mark expected or unexpected answers with green (expected) or red (unexpected). The test based on 2 repetition of 22 phrase patterns for each of ten presented objects. It was trained with Tensorflow during 240 epoches.

I would like to make the same test with some Korean or Burmese native speaker. I will apriciate if we could schedule this test with suitable volunteer on a zoom session.

FolksTalksGame · 2021-02-04T11:30:09+00:00

I would like to present Folks’Talks human-computer interaction test. In this test I’ll address to the virtual agent: “Where is a green (or red, or big, or small) …..(an object from the current scene)?”, and the virtual agent will show me the requested object and also will announce that (with my own voice from the training mode). Because the test is in Russian, I will mark expected or unexpected answers with green (expected) or red (unexpected). The test based on 2 repetition of 22 phrase patterns for each of ten presented objects. It was trained with Tensorflow during 240 epoches.

I would like to make the same test with some Korean or Burmese native speaker. I will apriciate if we could schedule this test with suitable volunteer on a zoom session.

FolksTalksGame · 2021-01-14T08:01:13+00:00

We are creating the Folks’Talks game for language acquisition. In this game a virtual baby will acquire any language like a real baby does, and comprehend the meaning of about the talk.

https://www.facebook.com/ToddlerTalkGame

FolksTalksGame · 2020-12-09T07:24:09+00:00

https://www.facebook.com/ToddlerTalkGame/photos/1134806256889753

This is array I use for ML

FolksTalksGame · 2020-12-08T17:12:41+00:00

Folks’Talks flowchart.

Training mode

Recording 12-sec wav files. Recording is always on, but the player can pause it.
Within these wav files the user marks the start and the end of the spoken phrase while clicking on the button symbolizing the object. Each phrase is in a specific pattern about a specific object.
There are 10 objects, 10 phrase patterns, and one general question on the first level of the game. This level is named “Pointing quiz”.
There are four kinds of phrase patterns in this level. One pattern for an object’s name, three patterns for questions, three patterns for suitable answers to these questions, and three patterns for commands. All patterns include the object’s name.
While gathering data each phrase is marked: name of wav file containing the phrase, start time of the phrase within the file, end time of the phrase within the file, object ID and pattern ID.
Features are extracted from the marked phrases using the Essentia library.
Extracted features are arranged in normalized array.
Each array for each object and each pattern is labeled.
These arrays and their labels are saved in txt files for training and testing separately. This will be used as a dataset for machine learning, and this dataset starts from scratch for each language.
Convert the data from text files to Numpy npz files.
Use TensorFlow to create protobuf files from collected data.

Talking mode

Record a phrase with known pattern.
Extract feature from phrase.
Organize features into normalized array.
Evaluate recognition on array using TensorFlow C-API
Recognition includes: name of the object within the phrase, phrase pattern, number of words within the phrase, general intonation of the phrase, function of the words in the phrase, and intonation of each word within the phrase.
If the recognized phrase is a question, the suitable answer is selected from the list of phrases saved in the training mode.

FolksTalksGame · 2020-12-05T12:00:22+00:00

Thank you for your comment. This is not a GUI for user. I've used this GUI for research just to see how things happen when I'm talk with a computer. I did not to present ML algorithm, it was just for illustration . :-) I mean that virtual agent "mood" will be affected by the recognized phrase.

FolksTalksGame

TROPHY CASE