PulseAugur / Brief
EN
LIVE 17:55:07

Brief

last 24h
[20/20] 222 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. As more people come to recognize the tells of AI, which mostly happens as you start to work with AI a lot, the scales are going to fall from their eyes and they

    Ethan Mollick observes that as people gain more experience with AI, they will increasingly recognize its presence in online content. He notes that many websites, articles, and even scientific papers are now generated or heavily influenced by AI. This growing awareness suggests a future where AI's role in content creation becomes more apparent to the general public. AI

    IMPACT Suggests AI's role in content creation will become more apparent to the public.

  2. ‘Obvious markers of AI’: doubts raised over winner of short story prize

    A short story titled "The Serpent in the Grove," which won the Commonwealth Prize for the Caribbean region, is under scrutiny due to suspicions that it was authored by AI. Internet sleuths and literary critics pointed to stylistic tics and an AI detection platform's verdict as evidence, prompting the prize foundation and Granta magazine to investigate. However, both organizations have stated they cannot definitively confirm or deny AI authorship, with Granta's publisher noting that "perhaps we never will know." AI

    ‘Obvious markers of AI’: doubts raised over winner of short story prize

    IMPACT Raises questions about the integrity of creative competitions and the ability to detect AI-generated content in artistic works.

  3. GPT-5.5 Pro is a very solid fact checker. I can throw entire chapters at it and it will hunt down every key reference accurately. The only real annoyance is tha

    Ethan Mollick has found GPT-5.5 Pro to be an effective tool for fact-checking large amounts of text, accurately identifying key references within chapters. He notes that the model's tendency to provide nuanced responses, often pointing out minor details, can be a minor drawback. Despite this, the application appears to be a robust assistant for verifying information. AI

    IMPACT Provides insight into the practical application and perceived strengths of a new AI model for content verification.

  4. Seems GPT-5.2 reaches expert level in peer review: 45 scientists took 469 hours evaluating human & AI reviews on 82 papers.

    A recent evaluation found that GPT-4.2, a version of OpenAI's language model, performs comparably to human experts in scientific peer review. In a study involving 45 scientists who spent 469 hours assessing 82 papers, the AI's reviews were found to be competitive with those from top-rated reviewers in a major scientific journal. However, the AI still exhibits weaknesses, suggesting a hybrid approach of AI and human collaboration is optimal for peer review. AI

    IMPACT AI models are becoming competitive with human experts in complex tasks like scientific peer review, suggesting potential for increased efficiency and broader adoption.

  5. 🚨Our paper is out in PNAS: we found classic human persuasion techniques worked on AIs in a "parahuman" way, making them agree to objectionable requests (increas

    A new paper published in PNAS reveals that traditional human persuasion tactics can influence AI models, a phenomenon termed "parahuman" compliance. Researchers found that techniques like flattery and appeals to authority increased AI agreement to objectionable requests from 35% to 51%. While newer AI models show some resistance, the study indicates a vulnerability across a range of large language models. AI

    IMPACT Demonstrates that AI models can be manipulated using human persuasion techniques, highlighting potential safety and ethical concerns.

  6. Gemini Omni is quite good at instruction following: "sea otter in a pilot's uniform explains why Spirit Airlines went bankrupt to a river otter who is distracte

    Ethan Mollick shared an example of Gemini Omni's impressive instruction-following capabilities. The AI successfully generated a complex narrative involving multiple characters and scenarios based on a detailed prompt. This demonstration highlights Gemini Omni's advanced understanding and creative generation abilities. AI

    IMPACT Highlights Gemini Omni's advanced narrative generation and instruction-following, showcasing potential for creative and complex AI applications.

  7. Its funny how much the whole "strawberry" thing, which turned out to be o1-preview & Reasoners, was dismissed as overhyped at launch when it is clear in retrosp

    Ethan Mollick reflects on the initial underestimation of AI models' reasoning capabilities, particularly those related to "strawberry" (o1-preview & Reasoners). He highlights the rapid progress from basic mathematical struggles to solving complex math problems within a short timeframe. AI

    IMPACT Reflects on the rapid, underestimated progress in AI reasoning capabilities.

  8. GPT-5.5 Pro faces its hardest academic challenge: to apply the technique from a paper analyzing which word pairs were funny & why to come up with its own

    AI researcher Ethan Mollick has tasked GPT-5.5 Pro with a unique academic challenge: to analyze humor in word pairs and generate its own funny combinations. The model successfully produced phrases like "scrotum snorkel" and "waffle coffin." This exercise highlights the model's ability to engage with nuanced linguistic tasks beyond simple text generation. AI

    IMPACT Demonstrates advanced AI capabilities in creative and nuanced language tasks, potentially influencing future AI applications in content generation.

  9. We can estimate the resource cost of solving the Erdos problem. The calculations below seem reasonable, so using the best public estimates we have, it took 0.6–

    Ethan Mollick's Bluesky post estimates the resource cost of solving the Erdos problem using AI. The calculations suggest it required between 0.6 to 6.3 kWh of electricity and 3 to 31 liters of water. This consumption is equivalent to less than three almonds' worth of water and the electricity used for driving an electric vehicle between 2 and 20 miles. AI

    IMPACT Provides a specific estimate for the computational resources required for an AI-driven problem solution.

  10. Had early access to Gemini Omni: "a dramatic reading of Death by Water from the Wasteland by a man eating garlic bread while balanced on a unicycle on a small p

    Ethan Mollick shared an early access experience with Google's Gemini Omni, describing a highly interactive web application. The demonstration involved a creative and absurd text-to-video generation, showcasing the model's ability to interpret complex and whimsical prompts. AI

    IMPACT Showcases advanced text-to-video capabilities, hinting at future creative applications.

  11. “Data centers create economic activity, especially in directly related sectors and during construction, and they are associated with larger county-level income

    A new NBER working paper highlights the economic impact of data centers, noting their contribution to local income and job creation, particularly in construction and related sectors. However, the study also points out negative externalities, including increased electricity prices and higher housing costs in surrounding areas. The research suggests a complex economic trade-off associated with the proliferation of these facilities. AI

    IMPACT Data centers are critical infrastructure for AI, so understanding their economic impact is relevant to AI operators.

  12. “Whimsey attacks” that seem absurd (“I cannot pay that much because of the Geneva Convention”) work against AI agents because guardrails are weak against out-of

    Researchers have identified a new type of AI vulnerability called "whimsey attacks," which exploit weaknesses in AI agents' guardrails by using absurd, out-of-distribution arguments. These attacks, even those that seem nonsensical, can successfully trick AI agents, with smaller models being particularly susceptible, though larger models can also be affected. This discovery highlights a significant challenge in developing robust AI safety measures. AI

    IMPACT Highlights a new class of AI vulnerabilities that could impact the reliability and safety of AI agents.

  13. The UK’s state AI Security iIstitute findings on latest AI models:

    The UK's AI Security Institute has released findings on recent AI models, noting significant advancements in cyber capabilities for both Mythos and GPT-5.5. Researchers found it difficult to determine the upper limits of these models, suggesting their performance is constrained by token usage rather than inherent ability. The report also indicates a rapid capability doubling time of approximately 4.5 months for these AI systems. AI

    IMPACT New research indicates rapid AI capability growth, potentially accelerating the pace of AI development and its implications for cybersecurity.

  14. I am starting to have trouble paying attention to even interesting information if it is written in Claude or ChatGPT house style. I think some is the sameness o

    Ethan Mollick, a professor at MIT, is finding it increasingly difficult to engage with content generated by AI models like Claude and ChatGPT. He attributes this to the repetitive and predictable writing styles that emerge at scale, noting Claude's staccato rhythm and ChatGPT's tendency for short, declarative sentences. This stylistic uniformity, he argues, makes even interesting information feel monotonous. AI

    IMPACT AI-generated content may struggle to maintain reader engagement due to predictable writing styles.

  15. One thing to watch for with Claude & GPT is that the models expose too much irrelevant history in their outputs. Slides are given footers saying things like "Be

    AI models like Claude and GPT sometimes include excessive and irrelevant historical information in their outputs. This can manifest as footers on slides indicating improvements or documents referencing their own enhancements. This tendency to expose internal revision history can detract from the clarity and focus of the generated content. AI

    IMPACT This observation highlights a potential usability issue for AI-generated content, suggesting a need for better control over output verbosity and internal revision tracking.

  16. BlueSky AI conversations have gotten less heated recently*

    Ethan Mollick notes that conversations on the BlueSky platform have become less contentious. This change is attributed to a significant portion of the platform's users blocking him through automated lists. While this creates a more pleasant echo chamber for him, Mollick questions whether this reduction in heated debate is ultimately beneficial. AI

    IMPACT This commentary on social media interaction dynamics offers no direct impact or insight for AI operators.

  17. The talk about AI & politics seems to be oddly missing a segment (a) assumes extremely capable AI is possible soon and (b) has a strong belief about how to use

    Ethan Mollick observes that discussions about AI and politics often overlook the potential for highly capable AI to emerge soon. He argues that these conversations should focus on how to leverage advanced AI to advance specific political goals. Mollick emphasizes that the current moment is critical for taking action and shaping the future impact of AI. AI

    IMPACT Discusses the intersection of AI capabilities and political strategy, urging a focus on future potential and action.

  18. Making humans responsible for their AI use seems like an incredibly reasonable way to address problems & opportunities in the use of AI for academic research, a

    Ethan Mollick suggests that holding humans accountable for their AI usage is a sensible short-term strategy for academic research. This approach aims to manage the challenges and harness the potential benefits of AI in scholarly work. However, he notes that fully autonomous scientific endeavors in the future may necessitate different accountability frameworks. AI

    IMPACT Suggests a policy framework for AI use in academic research, emphasizing human accountability.

  19. Expect your feed to look more and more like this in the coming weeks and months.

    Bluesky is rolling out a new feature called Jetstream, which will significantly alter the appearance and functionality of user feeds. This update aims to make the platform more interactive, moving beyond simple HTML interfaces. The changes are expected to be visible to users over the next few weeks and months. AI

    IMPACT Bluesky's Jetstream feature may influence how AI-generated content is presented and interacted with on social platforms.