Loading
I recently ran a research and an evaluation of top LLMs on the MedQA dataset (Vals.ai, 09 May 2025). Normally these tests are multiple-choice questions plus five answer choices (A–E). They show the following: – o1 96.5 %, – o3
Not sure if anyone else has felt this, but most AI sales tools today feel… off. We tested a bunch, and it always ended the same way: robotic follow-ups, missed context, and prospects ghosting harder than ever. So we built
Found out that people are making entire games in UE using Ludus AI agent, and documenting the process. Credit: rafalobrebski on youtube submitted by /u/SmalecMoimBogiem [link] [comments]
It’s not perfect, but it does a pretty good job. I’ve been running around testing it on different things. Here’s what I’ve found that it can recognize so far: -Clanging a knife against a metal french press coffee maker. It
SoundCloud changes policies to allow AI training on user content.[1] OpenAI agrees to buy Windsurf for about $3 billion, Bloomberg News reports.[2] Amazon offers peek at new human jobs in an AI bot world.[3] Visual Studio Code beefs up AI
Note: When I wrote the reply on Friday night, I was honestly very tired and wanted to just finish it so there were mistakes in some references I didn’t crosscheck before sending it the next day but the statements are
Hey people, so I’ve been seeing so many people getting stuck at ideation phases, And so many people who are inherently ambitious but don’t exactly know what to do with all of their fire, people who wish to take control
submitted by /u/katxwoods [link] [comments]
I’m trying to monitor the best sources for AI news. It seems to me most of this is happening on Twitter and Reddit. Would you agree? Am I missing somewhere? submitted by /u/brainhack3r [link] [comments]
Hey guys, just launched a fully open source alternative to wandb called mlop.ai, that is performant and secure (yes our backend is in rust). Its fully compatible with the wandb API so migration is just a one line change. WandB