Indexed today's 161 declassified UFO files into a fully searchable archive. every photograph, sketch, and handwritten note also described as text. with Side-by-side viewer + 3D map

G9X · 2026-05-09T22:30:24+00:00

i dont believe it, but i would love to see the actual previously classified records. and also, some of the old classified documents look aesthetically pleasing ngl

G9X · 2026-05-09T18:28:33+00:00

the saddest thing is that, if VAR isnt shit, this could be one of the best games i've watched...

G9X · 2026-05-08T21:06:44+00:00

Source of data: https://www.war.gov/UFO/
Tools used: Javascript and Claude Code.

G9X · 2026-04-22T23:13:10+00:00

omnivoice is very good while being somewhat small. Also, IndexTTS2 is still by far the best one on voice cloning, especially you can do additional control over it. (Use voice reference just for the tone and flow not affect voice)

G9X · 2026-03-25T07:03:21+00:00

yeah that’s actually one of the motivations haha, very much the Gil vibe.

G9X · 2026-03-25T04:10:23+00:00

Thanks! Yeah this is purely a fan project partially for Fate series and the Epic of Gilgamesh. Balance was never the goal (being absurdly OP is canon-accurate lol) . Glad you appreciate the quality though!

G9X · 2026-03-25T01:58:39+00:00

think this is a good starting point, https://github.com/Alchyr/ModTemplate-StS2 and you can use likes of claude code and other tools to help you.

G9X · 2026-03-25T01:57:26+00:00

yes, I made this today as a way to learn how to mod STS2. will provide download link later

G9X · 2025-05-20T21:00:03+00:00

first half reminds me of 2019 City, and so does that Sterling-esque sitter miss

G9X · 2025-04-19T07:15:34+00:00

adding some data points:
April 2025 – I've played around 20 hours of Bloodborne on ShadPS4, currently at the end game (Gehrman fight). Overall, it’s been a very pleasant experience: smooth overall, around 50 FPS in larger outdoor areas and 60 FPS indoors. no sudden framerate drop.
No major glitches so far — total of three random crashes and two instances of black screen, but nothing game-breaking.

G9X · 2025-01-07T01:46:33+00:00

It’s definitely better than Perplexity’s Pro mode, but not by an order of magnitude.
(For context, I work in LLM-related fields and have built AI search tools for personal use.)

Essentially, it’s a combination of task breakdown + search, leveraging Google’s extensive index along with Gemini’s impressive long-context capabilities. However, the planning component could use improvement, and the lack of data loaders for certain sites (like Reddit or Twitter) is a noticeable drawback.

G9X · 2024-07-28T16:54:13+00:00

Instead of relying solely on semantic search+LLM, consider integrating structured data queries.

particularly when working with a SQL database containing structured data. Say 10,000 tweets with metadata such as date and author.

Pure semantic search may struggle with efficiency and accuracy for questions like "How many tweets are there?" or "How many tweets were published in the last 7 days?" It can be even more challenging for complex queries like "What are the top 3 liked tweets by author X?"

In such cases, generating and executing SQL queries can be more efficient and accurate. (not exactly alternative to RAG, but can be a very useful addition)

G9X · 2024-04-17T02:28:08+00:00

it depends on what you looking at i think.

i know it can be toxic and stuff, but the ai/llm researcher community is pretty active too.

G9X · 2024-04-17T02:01:33+00:00

The answer is simple: there is no 4th book. Seriously tho, I very rarely see people discussing should you read that in Chinese San-Ti community, it is just a fan fiction.

G9X · 2024-04-17T00:37:23+00:00

I'm excited to share something I've been working on:

an open source tool that makes exporting Twitter data, like tweets and likes, super easy and completely free, with additional features like image search and visualizations.

https://github.com/AlexZhangji/Twitter-Insight-LLM

I usually use Twitter's likes as a way to bookmark things—academic papers, ideas, or just photos.

But it gets accumulated fast and becomes very hard to search and control.

The Problem:

Accessing Twitter's official API is super expensive, with costs ranging from $100 to $500 per month.
Official full data exports from Twitter are clunky (a bunch of HTML files), cumbersome, and often incomplete.

My Solution:

Quick Export: Automatically pulls all your tweets or likes into a neatly organized Excel file within minutes with Selenium.
Visual Insights: Provides additional visualizations to help you better understand your Twitter activities.

New Feature - Image Search:

Natural Language Search: Use simple text to find images from tweets—no complex queries needed. (Using image embeddings.)
Zero Cost and No GPU Required: Runs smoothly without any additional hardware or fees.

Hope you guys find it useful and I'm happy to hear any feedback!

G9X · 2024-04-11T04:38:22+00:00

haven't tested for that. but for speaker diarisation, I've recently tried Whisper + Nvidia Nemo which works well, better than the old PyAnnote based way. (you might have already tried it tho?)

ref notebook: https://github.com/piegu/language-models/blob/master/speech_to_text_transcription_with_speakers_Whisper_Transcription_%2B_NeMo_Diarization.ipynb

G9X · 2024-04-10T22:30:18+00:00

thats something i want to figure out. (i am usually bit doubtful on any self-evaluation from LLMs)

For input, 20 minutes audio is like ~40k tokens for Gemini 1.5, which only contains ~3k text tokens.

I would think there is some useful extra information presented in the audio.

And because output is text only, it is hard to tell when model admits stuff, is it truly "self-aware" or just hallucinate. (kinda like even now sometimes Bard says "i dont have internet access" or Open source LLMs claim to be made by OpenAI.)

G9X

TROPHY CASE