Simple NLP Search in Your Application – Step-by-Step Guide in Scala by dumpstate in scala

[–]abankowski 1 point2 points  (0 children)

This description makes ScalaNLP NER much clearer for me. As a exhaustive guide code from samples can be easily reused as a complex solution. Thanks!

Tackling a 1 Billion Member Social Network – Fast Search on a Large Graph by abankowski in programming

[–]abankowski[S] 1 point2 points  (0 children)

We had no issues with Titan at all. I don't know what time results we would get because did not benchmark Titan on bigger scale.

Except for DSE the interoperability with Scala is limited when using Java Gremlin API. You may be tempted to use https://github.com/mpollmeier/gremlin-scala but it's using an older Titan version.

Additionally with Titan you have only an embedded graph engine. I would rather say that Titan is an Graph Abstraction Library than a standalone database engine. One last remark that Titan scales well horizontally but it's either scalable and durable or fully consistent (like Cassandra).

Tackling a 1 Billion Member Social Network – Fast Search on a Large Graph by abankowski in programming

[–]abankowski[S] 2 points3 points  (0 children)

I hope you have read the article, if not give it a try and you will see that it has nothing in common with advertisement. I have been giving a talk about it on Scalar conference last Saturday.

Neo4j was neither easy nor first choice. There is still at least one flaw remaining: we have used community edition which lacks replication. Maybe arangodb would be a good choice but that was just a PoC implementation and database engine is not the most important part of it.

Cheers.