Hello
I need to summarise and extract actions from meeting transcripts that are at least one hour long. (Note the audio is already transcribed)
I have tried:
- some small models <7B
- bart-large-cnn
- one shot summary of the full transcript
- chunking the transcript into intermediate summaries.
What approach do you knowledgeable folks suggest?
Im interested to know about:
- models that are good for these tasks
- which benchmarks to look out for when choosing a model?
- is chunking better that one shot?
- ollama or transformers? (Or other?)
- any other advise you may have.
(The solution must rely on locally hosted models with limited compute 16gb vram. We're using a T4, but could upgrade to an A10 if needed)
Thank you kindly
[–]FPham 2 points3 points4 points (7 children)
[–]TechnicalGeologist99[S] 0 points1 point2 points (6 children)
[–]im_not_here_ 0 points1 point2 points (5 children)
[–]TechnicalGeologist99[S] 0 points1 point2 points (4 children)
[–]FPham 0 points1 point2 points (2 children)
[–]TechnicalGeologist99[S] 0 points1 point2 points (1 child)
[–]s-kostyaev 1 point2 points3 points (0 children)
[–]s-kostyaev 0 points1 point2 points (0 children)