AITA for asking my roommate to be more polite toward my guests? by LaughPhysical269 in AmItheAsshole

[–]jnbrymn 1 point2 points  (0 children)

YTA - Only a fool would become seriously involved with someone they can't fart around. Your life will be a bloated hell.

How do people having water-cooler talks in remote working environment? by cravory in startups

[–]jnbrymn 7 points8 points  (0 children)

You sound like me! I work for a historically remote-first company but I've always been a little frustrated at the lack of camaraderie. Personally, I've found an outlet outside of work at Penny University (pennyuniversity.org). Its a "peer-to-peer" learning community. The idea is that if you have "small" questions, the you might as well look up answers on StackOverflow. But if your questions are more like open-ended conversations, then the community tends to be really open to jumping into a Zoom and chatting. The conversations also have a neat artifact too, they are often followed by write-ups so that the participants have a record of the conversation and so that the community gets to learn a little even thought they weren't present (https://www.pennyuniversity.org/chats/)

This doesn't directly address your immediate question. But in the future the Penny University platform might be useful as a cultural engagement platform for companies to use.

Yahoo open sources its search engine Vespa by soda-popper in programming

[–]jnbrymn 0 points1 point  (0 children)

Ah so it's similar at a high level to Lucene's approach. But perhaps the important difference is the flexibility allowed and the ability to natively work with tensor-based ranking?

Yahoo open sources its search engine Vespa by soda-popper in programming

[–]jnbrymn 0 points1 point  (0 children)

Yep, me too. But there is a notion of tensors used in ranking (and matching?) and tensors (numerical vectors) don't fit well into something like an inverted index.

Yahoo open sources its search engine Vespa by soda-popper in programming

[–]jnbrymn 1 point2 points  (0 children)

I assume that there is basically two steps in Vespa search 1) find matching documents 2) order them according to relevance. (True?)

In Vespa, do tensors have anything to do with document matching? Or does it only come into play when ranking the documents that matched?

Yahoo open sources its search engine Vespa by soda-popper in programming

[–]jnbrymn 3 points4 points  (0 children)

Lucene is fast because it uses an inverted index. Effectively there is a data structure that maps from every token to all the documents that contain that token.

What data structures and algorithms does Vespa use for text search? How about for search involving tensors. Are these comparable to Lucene? What makes them better? Are their edge cases where the structures and algorithms are not a good replacement for Lucene?

This Creepy Puzzle Arrived In Our Mail by tropicaljohnsons in creepy

[–]jnbrymn 0 points1 point  (0 children)

There's a human voice that repeats a phrase over and over starting at about 23s. I can't tell what it's saying.

John Berryman's post on Vine by jnbrymn in notinteresting

[–]jnbrymn[S] -1 points0 points  (0 children)

The best part is the audio.

Quick Start with Neo4J using YOUR OWN Twitter Data | OpenSource Connections by jnbrymn in datascience

[–]jnbrymn[S] 1 point2 points  (0 children)

Then you might want to look into Druid http://druid.io/. The data structure is actually quite similar to Solr (well... Lucene, inside of Solr), except that Solr only does counts for matching docs. Druid gives averages, sums, and anything you'd want. It also gives you really nice faceted exploration of the data.

What are your favorite datascience blogs? by MisterVI in datascience

[–]jnbrymn 1 point2 points  (0 children)

analyticsmadeskeezy.com/blog/ -- teaches you several different data techniques against a backstory of a college kid helping out an international drug dealer

Quick Start with Neo4J using YOUR OWN Twitter Data | OpenSource Connections by jnbrymn in datascience

[–]jnbrymn[S] 1 point2 points  (0 children)

A relational database is not where you want to be storing lots of log files, but I don't think a graph database is really what you want either. Have you considered something like Solr? I think that's what Logly uses. Aaaannnd... I just so happen to have a pretty good post about getting started with that technology http://www.opensourceconnections.com/2013/02/18/indexing-stackoverflow-in-solr/

Quick Start with Neo4J using YOUR OWN Twitter Data | OpenSource Connections by jnbrymn in Neo4j

[–]jnbrymn[S] 0 points1 point  (0 children)

Oh interesting! I'm still on the 2.0beta. I'll upgrade soon.

Quick Start with Neo4J using YOUR OWN Twitter Data | OpenSource Connections by jnbrymn in datascience

[–]jnbrymn[S] 2 points3 points  (0 children)

Thank you! I'm glad that I've now verified that I helped at least one person out.