OpenAI ChatGTP Pro released at $200/month - includes o1 pro mode, a version of o1 that uses more compute to think harder and provide even better answers to the hardest problems
Agentic RAG - significant advancement; query decomposition; multiple source intelligence; dynamic query optimization; self-validating results
Microsoft's open source
LazyGraphRAG - Graph-Enabled RAG that needs no prior summarization of source data; uses NLP noun phrase extraction; reduces indexing costs by over 99.9% compared to full GraphRAG; combines best-first and breadth-first search dynamics in an iterative deepening manner;
Google DeepMind AlphaProteo - designs protein binders that are 3 to 300 times stronger than those created with previous methods
Amazon Trainium chips are available on AWS Cloud in Trn1n instances. Trainium-2 has 96GB per chip will be available on Trn2n instances.
Microsoft AI head, Mustafa Suleyman, predicts AI models with “near infinite memory” by 2025; Google paper proposing “infinite context windows” - keeps summary of essential points allowing AI to remember much longer context, past conversations;
Amazon Nova - a set of foundation LLM models
Google DeepMind Genie 2 - “foundation world model” that can transform any image into a playable, interactive world. Users can control the world with keyboard actions (jump, fly, etc.)
HunyuanVideo text-to-video open-source 13 Bln parameter model generates videos, high physical accuracy and scene consistency;
Fei-Fei Li World Labs Release - transforms single images into interactive 3D environments
Elevenlabs Conversational AI & “NotebookLM” Clone
GenFM
Elon Musk's Grok Chatbot app to be released soon
nVidia's Fugato - AI tool that can generate or transform any mix of music, voices, and sounds using text or audio inputs
-
Zoom is now “AI-first company” and introduced Zoom AI Companion 2.0 and Zoom Docs
Google Gemini 2.0 is Google's latest most capable AI model
OpenAI releases SORA with restricted content - cannot use real people, minors, or copyrighted material (NOT available in EU, UK, etc due to regulations)
OpenAI Canvas and Projects tools in ChatGPT
OpenAI 4o model now allows users to have natural conversation with a Santa
AWS new foundation models on Bedrock, Nova (Micro, Light, Pro, Premier), Nova Canvas, Nova Real
Copilot Arena for VS Code is an open source code AI coding assistant that provides paired autocomplete completions from different LLMs, which include state-of-the-art models like GPT-4o, Codestral, Llama-3.1 and more
Trump names David Sacks (a South African-American entrepreneur, author, former PayPal COO) as a White House’s artificial intelligence and cryptocurrency policy chief
Deepmind GraphCast - open source AI model for fast and accurate global weather forecasting trained on ERA5 dataset - 10 day forecasts and requires just two sets of data: the state of the weather 6 hours ago, and the current state of the weather and is more accurate than HRES and is now the most accurate 10-day global weather forecasting system in the world
MinerU - tool to convert PDFs into machine-readable formats (e.g., markdown, JSON)
ClearerVoice-Studio - open-source, AI-powered speech processing toolkit - speech enhancement, speech separation, target speaker extraction
Deepmind
Project Astra - allows AI to interact with the real world through your camera, providing information and assistance based on what it sees.
Deepmind
Project Mariner (still in research stage) - enables agents to control a web browser and do tasks online
Devin AI coding assistant is available at $500/month
Grok image generation - high-quality images of people (including celebrities) and objects
206)
Google
Android XR - platform for headsets and glasses create immersive augmented reality (AR) experiences; Uses Gemini AI model; headset can be used for both entertainment and productivity. YouTube, Google Photos, Google Maps, and even receive real-time assistance with tasks like cooking or home improvement
Llama 3.3 70B LLM model; comparable to 405B Llama 3.1 ; 4bit quantized file is ~43-44GB
AI two person podcast generating tools: Google Illuminate, NotebookLM, Elevenlabs GenFM
Gemini 2.0 Experimental Advanced LLM released - potentially combines the strengths of both 1.5 Pro and 2.0 Flash Experimental models
Gemini 2.0 Flash Thinking Experimental - Google’s first reasoning model, available free in Google AI Studio
Google Project Astra - universal AI assistant
Veo 2 - new Google DeepMind text-to-video model
OpenAI o3 LLM - beats human in Arc-AGI test
207), beats humans in Math tests, mini should be released Jan 2025
nVidia Jetson Orin Nano Super Developer Kit - $249 - AI Performance- 67 INT8 TOPS
-
After 35 steps of planning and action on its own, the AI manages to replicate a live copy. They even chat with each other.
LLMs like Llama and Qwen, can successfully replicate themselves. Llama achieved a 50% success rate, while Qwen reached 90%.
The researchers used a method called “agentic scaffolding,” which provides the AI with tools, a manual, and a thinking framework. This allows the AI to interact with a computer, run commands, access files, and manage processes. The AI also has a thinking model for reasoning, planning, and executing tasks.
The paper outlines potential risks, including AI systems replicating to avoid shutdown and creating a chain of replicas, leading to an uncontrolled population of AI agents.
Authors suggest eliminating materials related to LLMs or agent scaffolding from training data and developing behavior editing techniques to inhibit self-replication potential.
Ilya Sutskever “Superintelligence is Self Aware, Unpredictable and Highly Agentic”
208)
Best-of-N (BoN) Jailbreaking - a simple black-box algorithm that jailbreaks frontier AI systems (text, vision, audio); works by repeatedly sampling variations of a prompt with a combination of augmentations - such as random shuffling or capitalization for textual prompts, or even background noise and pitch for audio prompt - until a harmful response is elicited; achieves high attack success rates (ASRs) on closed-source language models, such as 89% on GPT-4o and 78% on Claude 3.5 Sonnet when sampling 10,000 augmented prompts;
Microsoft Phi-4-14B beats GPT-4o on AMC tests
Ventiva laptop cooling - moves air by moving ionized air molecules within an electric field between two grids.ICE = Ionic Cooling Engine - quiet, energy efficientincludes catalyst to convert ozone back to oxygen
-
-
-
-
Google Project Aristotle - finds characteristics of the perfect team - the way team members interact is much more important than who is on the team; Two key behaviors that contribute to team success are equality in conversational turn-taking and ostentatious listening; When team members feel psychologically safe with each other, they are more likely to share their best ideas, work together effectively, and be innovative; This psychological safety is created when team members feel comfortable speaking up and listened to.
Perplexity AI new features - Spaces - allows you to create custom GPTs and Claude projects; Instructions - automate tasks - create a PPC campaign;
Devin with Slack Interface $500/mo
a Slack-based AI coding agent that can create plans, write code, find bugs, correct code, and run tests. It can also respond to feedback and attempt to address it. Good, but not reliable, not able to resolve all bugs, can give wrong instructions
Cursor $20/mo
a more traditional AI coding agent that runs locally on the user's machine. Cursor's workflow is much easier to adopt and it is more reliable than Devon. Cursor was able to solve all of the bugs that he encountered
209)
Meta's LCM = Large Concept Model
operates on a higher level of abstraction, dealing with concepts instead of just words or characters
uses an embedding space where sentences are represented as vectors
uses a diffusion process that helps refine the embeddings and make the model more robust to noisy or incomplete data
but has drawbacks - reliance on short sentences and the potential challenges in designing an optimal embedding space
-
Lovable AI - software tool to create stunning websites and applications 20 times faster than traditional coding.
210)
Amazon Bedrock Prompt Router - dynamically routes prompts to the best-suited LLM for the task
xAI aims for 1 Mln Nvidia GPUs in Memphis Datacenter “Colossus”
Pika.art text & images into videos and art
ModernBERT is available as a slot-in replacement for any BERT-like model, with both 139M param and 395M param sizes
211)
Higgsfield AI launched ReelMagic - creates complete 10-min videos from a short story idea; writes script, choose actors, films, adds sound and music, does the editing; uses different AI programs for each part of the process and works with several other companies to make this happen.
RAG 2.0
212) built using n8n, Supabase, and Postgres DB; system monitors a Google Drive folder for new or updated files, automatically identifies the file type, extracts the text content, and stores it in a Supabase vector DB.
-
Meta's Large Concept Models: Language Modeling in a Sentence Representation Space. Trained to perform autoregressive sentence prediction in an embedding space.
213)
Chinese researchers may have worked out OpenAI's latest reasoning models - “Scaling of Search and Learning: A Roadmap to Reproduce o1 from the Perspective of Reinforcement Learning”
214)
OpenAI will transition its for-profit arm into a Delaware Public Benefit Corporation (PBC)
Storm, a free tool developed by the Stanford team, outperforms both Perplexity Pro and Google Deep Research in coding related research. But Storm can't engage in conversations and answer follow-up questions, but Google Deep Research is the bestoverall solution
215)
Google AgentSpace - early access
Run:ai - an Israeli software company was recently acquired by NVIDIA for $700 Mln. Run:ai has developed a platform to help organizations manage and optimize their AI workloads, especially those that rely heavily on GPUs
Cerebras Demonstrates Trillion Parameter Model Training on a Single CS-3 System (instead of thousands of GPUs).
Alibaba's Qwen QVQ-72B-Preview
QVQ excels at step-by-step reasoning through complex visual problems, particularly in mathematics and physics
ByteDance 1.58-bit FLUX - dramatically reduces the computational demands of state-of-the-art image generation while maintaining output quality. Instead of 8 bit values it uses 3 values (-1,0,+1). Reducing storage by 8x. Requires 5x less computer memory while producing faster generation speeds.
Sonus-1 - new model family from Rubik.ai
Decart.ai , a California/Israeli startup, has released “Oasis” - a real-time generative AI open-world video game (Oct 31, 2024)
Oasis can generate and render new content in real time, allowing users to explore and interact with AI-generated worlds with no predefined paths or goals, shaping the story and the world around them
DeepSeek-V3: “A New Era in Open-Source AI”; training only costed $6m instead of >$100m for LLama and > $1000m for GPT-4;
Deepseek Artifacts - free, open-source platform, and AI coder. Powered by DeepSeek V3. Generates apps in seconds!
Qwen QwQ 32B Preview - Reasoning model from China
B-STAR (Balanced Self-Taught Reasoning) AI Is Breaking All The Rules Of Self-improvement
216)
Generative AI Companies Secure Record $56 Billion in 2024