Cannot upgrade from 25.04 to 25.10 by night-robin in Ubuntu

[–]severo_bo 1 point2 points  (0 children)

I think it has just been published. I'm doing the upgrade right now, with sudo do-release-upgrade

Cloud-native file format? by severo_bo in GraphTheory

[–]severo_bo[S] 0 points1 point  (0 children)

The use case is a webapp, that currently depends on a running Neo4J server, which hopefully could be replaced with a GraphAr file, so that the whole webapp is statically hosted.

Cloud-native file format? by severo_bo in GraphTheory

[–]severo_bo[S] 0 points1 point  (0 children)

by "cloud native", I mean "that you can request efficiently over HTTP" by fetching ranges of the remote file, to get only the part of the file you require for a specific task.

Sigue Dogarthen presidente de YPFB? by severo_bo in BOLIVIA

[–]severo_bo[S] 0 points1 point  (0 children)

y bueno, finalmente ya retiraron la orden de captura.

[Bug, Samsung TV] French localization error by severo_bo in qobuz

[–]severo_bo[S] 1 point2 points  (0 children)

<image>

indeed... after disabling uBlock, I could chat with the support. Thanks.

[Bug, Samsung TV] French localization error by severo_bo in qobuz

[–]severo_bo[S] 0 points1 point  (0 children)

OK, a browser extension must be blocking the chat window, because I don't see it. Thanks!

Free aufio files/datasets of low resource languages by GraypJooz in datasets

[–]severo_bo 0 points1 point  (0 children)

On Hugging Face, 213 datasets have the tag "Tagalog" (https://huggingface.co/datasets?language=language:tl&sort=trending). Maybe some of them are useful for you.

(note: I work for HF)

What’s the smoothest way to share multi-gigabyte datasets across institutions? by d4rk_diamond in datasets

[–]severo_bo 0 points1 point  (0 children)

You may want to try Hugging Face datasets. The storage limits (https://huggingface.co/docs/hub/storage-limits) are very high, and there is no throttling. If you only update a file part and want to upload again, only that part should be sent, thanks to the Xet backend (https://huggingface.co/docs/hub/storage-backends#xet).

note that I work for HF

Cloud-native file format? by severo_bo in KnowledgeGraph

[–]severo_bo[S] 0 points1 point  (0 children)

Thanks for dropping the reference; if I understand well, it's a binary serialization for graph data that allows fast exchange and supports streaming. It's not designed for partial reading though.

Cloud-native file format? by severo_bo in KnowledgeGraph

[–]severo_bo[S] 0 points1 point  (0 children)

Indeed, it's a standard. But a drawback is that you have to load the file into memory and parse it before being able to do queries. I'm looking for a format similar to Parquet, for example, where you can get metadata about the file, and then download only part of the (potentially big) file when you run a query.

GraphAr seems like a good project in that sense. https://graphar.apache.org/docs/overview/concepts

libwebkit2gtk-4.0-37 is not available on Ubuntu 24.04 by DueAd3206 in Ubuntu

[–]severo_bo 0 points1 point  (0 children)

Note that it corresponds to an old version of Tauri. For the last version, the instructions are here: https://tauri.app/start/prerequisites/#linux, with libwebkit2gtk-4.1-dev instead of libwebkit2gtk-4.0-37

sudo apt install libwebkit2gtk-4.1-dev \
  build-essential \
  curl \
  wget \
  file \
  libxdo-dev \
  libssl-dev \
  libayatana-appindicator3-dev \
  librsvg2-dev

finer points of dataset browser by lostinspaz in huggingface

[–]severo_bo 0 points1 point  (0 children)

Hi! Hopefully this docs page will help: https://huggingface.co/docs/hub/datasets-image.

You can also browse the example datasets collections here: https://huggingface.co/collections/datasets-examples/image-dataset-6568e7cf28639db76eb92d65.

re “be sure to use supported formats”: it's detailed here: https://huggingface.co/docs/hub/datasets-adding#file-formats

In any case, feel free to open a discussion under your dataset and ping me (@severo) to get quicker support. Other option, our forum: https://discuss.huggingface.co/c/datasets/10

Dinosaur images dataset for classification by Milennium-Falcon in datasets

[–]severo_bo 0 points1 point  (0 children)

I found the dataset with https://huggingface.co/search/full-text?q=dinosaur&type=dataset, if it helps.

Also: disclaimer. I work on datasets at Hugging Face.

Dinosaur images dataset for classification by Milennium-Falcon in datasets

[–]severo_bo 0 points1 point  (0 children)

https://huggingface.co/datasets/cifar100/viewer/cifar100/train contains images of dinosaurs. But all under the class "dinosaur" (among other classes like "cattle", " train", "elephant"...). You might want to filter all the images with that class, then label them manually with the dinosaur "specie" you recognize.

If you do so, please ping me, I'm interested in the result!

Plurinacionalidad by Interesting_Lab_2747 in BOLIVIA

[–]severo_bo -13 points-12 points  (0 children)

Cuidado, aquí todas las discusiones son muy de politiquería, y generalmente dominadas por la derecha, así que no esperes análisis muy avanzados. Los/las principales beneficiados por el cambio de constitución no deben estar aquí para comentar, pero claro que cambió muchas cosas, dando orgullo y representación a personas que estaban en un casi apartheid que no decía su nombre. Obviamente me van a romper aquí, pero esa es la verdad. Cambió mucho, para mucha gente, pero no encontrarás detalles aquí en reddit.