Loading
Hey guys! I made a Hugging Face dataset a little while ago consisting of 5000 podcasts, and was shocked to see it become the most downloaded conversation dataset on the platform. I’m proud of it, but also think that there is room for improvement. I was wondering if any of you can think of a way to make it more valuable, or if not, if there are any other datasets you may want to use that don’t exist yet. LLMs are the future, and I want to help the community as much as possible.
Link to Dataset: https://huggingface.co/datasets/ReadyAi/5000-podcast-conversations-with-metadata-and-embedding-dataset
submitted by /u/ready_ai
[link] [comments]