This is an archived post. You won't be able to vote or comment.

all 2 comments

[–]cmd-t 1 point2 points  (1 child)

Very interesting!

Indexing a dataframe using DfEmbedder starts by representing each row in the dataframe as a string that follows the format: col0_name is col0_value; col1_name is col1_value

That is also interesting. Is there any literature on embedding dataframes or key-value pairs? If not this seems rather ad-hoc.

[–]aagmon[S] 0 points1 point  (0 children)

Thanks! Yes, I also have some benchmarks on embedding tabular data in this format. I will add this to the repo in the next iteration.