CCExtractor Development
Find out a way to get a current topic based on captions and implement it
Suppose you have a set of captions lines from a few minutes of newscast. When a person reads these captions he can immediately tell you which topic is being discussed, e.g. “it’s about US president election” or “it’s about a species of fish that can jump out of water ”. We want you to do some research on how to summarise a set of given captions lines to tell a topic of them. As this problem is well known and there are algorithms for it on the Internet, we also want you to find a good one and implement it.
Feel free to choose a language of your choice!
We expect for this task the code of the implementation you made.
Also check another related task which is called "Research on text-summarization algorithms in context of closed captions."
Task tags
Students who completed this task
Nikunj Taneja, Matej Plavevski