Frozen AI News archive

GraphRAG: The Marriage of Knowledge Graphs and RAG

**Microsoft Research** open sourced **GraphRAG**, a retrieval augmented generation (RAG) technique that extracts knowledge graphs from sources and clusters them for improved LLM answers, though it increases token usage and inference time. **Gemma 2** models were released focusing on efficient small LLMs with innovations like sliding window attention and RMS norm, nearly matching the larger **Llama 3 70B**. **Anthropic's Claude 3.5 Sonnet** leads in instruction following and coding benchmarks, while **Nvidia's Nemotron 340B** model was released in June. **Qwen2-72B** tops the HuggingFace Open LLM leaderboard excelling in math and long-range reasoning. Discussions on RAG highlighted its limitations and improvements in context usage via function calls. A persona-driven synthetic data generation approach introduced 1 billion personas, with a fine-tuned model matching GPT-4 performance on math benchmarks at 7B scale. The **200GB AutoMathText dataset** was also noted for math data synthesis.

Canonical issue URL

AI News for 7/1/2024-7/2/2024. We checked 7 subreddits, 384 Twitters and 30 Discords (419 channels, and 2518 messages) for you. Estimated reading time saved (at 200wpm): 310 minutes. You can now tag @smol_ai for AINews discussions!

Neurosymbolic stans rejoice!

image.png

image.png

Microsoft Research first announced GraphRAG in April, and it was surprisingly popular during Neo4j's workshops and talks at the AI Engineer World's Fair last week (videos aren't yet live so we haven't seen it yet, but soon (tm)). They have now open sourced their code. As Travis Fischer puts it:

  1. use LLMs to extract a knowledge graph from your sources
  2. cluster this graph into communities of related entities at diff levels of detail
  3. for RAG, map over all communities to create "community answers" and reduce to create a final answer.

Or in their relatively less approachable words:

image.png

However, the dirty secret of all performance improvement techniques of this genre: token usage and inference time goes up 🙃

Also of note: their prompt rewriting approach


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3 Opus, best of 4 runs. We are working on clustering and flow engineering with Haiku.

LLM Model Releases and Improvements

Retrieval Augmented Generation (RAG) Techniques and Challenges

Synthetic Data Generation and Usage

Miscellaneous


AI Reddit Recap

Across r/LocalLlama, r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity. Comment crawling works now but has lots to improve!

LLM Development and Capabilities

Stable Diffusion Models and Training

Hardware and Performance

Optimizations and Benchmarks


AI Discord Recap

A summary of Summaries of Summaries

  1. LLM Performance and Benchmarking Advancements:

    • New models like Phi-3 Mini from Microsoft and Gemma 2 from Google are showing significant improvements in instruction following and performance.

    • The AI community is actively discussing and comparing model performances, with debates around benchmarks like AlignBench and MT-Bench.

    • There's growing interest in reproducible benchmarks, with efforts to replicate results using tools like lm_eval for models such as Gemma 2.

  2. Optimizing LLM Training and Inference:

  3. Open-Source AI Development and Community Collaboration:

  4. Multimodal AI and Generative Modeling:

    • Advancements in vision-language models are being discussed, with projects like Vistral 7B for Vietnamese and Florence-2 running locally with WebGPU.

    • Text-to-video generation is gaining traction, with tools like Runway's Gen-3 sparking discussions about capabilities and pricing.

    • The community is exploring combinations of models and techniques to achieve DALLE-3-level outputs, indicating a trend towards more sophisticated multimodal systems.


PART 1: High level Discord summaries

LM Studio Discord


HuggingFace Discord


CUDA MODE Discord


Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


Stability.ai (Stable Diffusion) Discord


OpenAI Discord


Eleuther Discord


Nous Research AI Discord


Modular (Mojo 🔥) Discord


OpenRouter (Alex Atallah) Discord


OpenAccess AI Collective (axolotl) Discord


LlamaIndex Discord


tinygrad (George Hotz) Discord


LangChain AI Discord


Latent Space Discord


Mozilla AI Discord


OpenInterpreter Discord


Torchtune Discord


Cohere Discord


LAION Discord


LLM Finetuning (Hamel + Dan) Discord


AI Stack Devs (Yoko Li) Discord


Interconnects (Nathan Lambert) Discord


Datasette - LLM (@SimonW) Discord


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

LM Studio ▷ #💬-general (193 messages🔥🔥):

LM Studio updates and issues, TTS integration in LM Studio, Model recommendations for local use, Challenges with large model loading, Gemma 2 model performance and updates

Links mentioned:


LM Studio ▷ #🤖-models-discussion-chat (148 messages🔥🔥):

Model performance and loading times, Phi-3 Mini update, Running models on different hardware, Excessive GPU idling temperature with LM Studio, Quantizing vision models for LM Studio

Links mentioned:


LM Studio ▷ #announcements (1 messages):

LM Studio 0.2.27 release, Improved support for Gemma 2 models, Bug fix for 'invalid creation parameter' issue, Advanced information on new updates, ROCm extension pack instructions for Windows

Links mentioned:


LM Studio ▷ #🧠-feedback (9 messages🔥):

Request for a channel in Norwegian, LM Studio update compatibility issues with AMD Radeon 7800 XT, Suggestions for handling GPU compatibility errors in LM Studio, ROCM handling changes in LM Studio 0.2.26, Vulkan backend support as a potential solution

Links mentioned:


LM Studio ▷ #🎛-hardware-discussion (47 messages🔥):

Transformers ASICs market potential, Sohu expected pricing, VRAM vs RAM for LLM inference, AMD GPU support issues in LM Studio, Enhanced GPU setups for LLM processing


LM Studio ▷ #autogen (2 messages):

LM Studio compatibility with group chat feature in autogen, Error handling with Llama 7b Instruct, Solutions for using LM Studio with multiple local models

Link mentioned: LM Studio | AutoGen: Open In Colab


LM Studio ▷ #amd-rocm-tech-preview (17 messages🔥):

ROCm extension performance, Code benchmark results, NVIDIA GPU for AI, Gemma 2 model issues, ROCm-specific GPUs, Linux ROCm extension pack testing


LM Studio ▷ #model-announcements (2 messages):

Phi 3 mini update to Phi 3.1 by Microsoft, Gemma 2 model updates for lmstudio community

Link mentioned: lmstudio-community/Phi-3.1-mini-4k-instruct-GGUF · Hugging Face: no description found


LM Studio ▷ #🛠-dev-chat (50 messages🔥):

LM Studio load command error after update, TokenInvalid error in Discord bot, Configuring LM StudioClient, Discord bot intents and permissions, Debugging and fixing Discord bot code

Links mentioned:


HuggingFace ▷ #announcements (1 messages):

Access tons of new fine-tunes for Transformers models with KerasNLP., Search Hugging Face datasets by column names with new API., Transformers 4.42 release with new models and features., Nearly 100k public models use the Hub to store tensorboard logs., Local Gemma announced for private and secure usage., AWS releases Chronos datasets on Hugging Face., Google releases high-quality Gemma 2 LLMs., Real-time Detection Transformer (RT-DETR) available in Hugging Face., Florence-2 runs locally with WebGPU in the browser., Intro to vision language models announced., New challenging LLM leaderboard released., Data Explorer video series by Argilla announced., Efficient PyTorch dataloaders for distributed training., New RAG with Gemma recipe using elastic search.

Links mentioned:


HuggingFace ▷ #general (318 messages🔥🔥):

Downloading Falcon40B and encountering issues, Comparison of Falcon 40B to other models like LLaMA 3, RAG (Retrieval-Augmented Generation) techniques and challenges, Parsing and managing large transcripts in LLMs, Summarization techniques for transcripts

Links mentioned:


HuggingFace ▷ #today-im-learning (2 messages):

diffusion models for learning, advanced CNN topics resources


HuggingFace ▷ #cool-finds (10 messages🔥):

AI figuring out meshes, Top 5 Python frameworks, AI and society paper, Running transformers on robots

Links mentioned:


HuggingFace ▷ #i-made-this (5 messages):

Vision-Language model for Vietnamese, AI and e-commerce projects, CriticGPT for code correction, Stable release of Embodied Agents toolkit

Links mentioned:


HuggingFace ▷ #reading-group (5 messages):

HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks, Terminator architecture, Terminator code repository, Fast training convergence with Terminator architecture

Link mentioned: GitHub - hyperevolnet/Terminator: The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.: The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction. - hyperevolnet/Terminator


HuggingFace ▷ #computer-vision (4 messages):

Resources for learning advanced CNN topics like ViT and Unets, Interest in working on computer vision tasks, Prompting etiquette reminders


HuggingFace ▷ #NLP (1 messages):

embedding numbers, specific embedding models


HuggingFace ▷ #diffusion-discussions (2 messages):

Problem with LLama.cpp causing access violation error, Seeking pre-trained model recommendation for fake voice detection


CUDA MODE ▷ #general (3 messages):

CUDA-only hackathon at the AGI House in San Francisco, Participants receive H100 access for the hackathon

Link mentioned: RSVP to Hardcore CUDA Hackathon | Partiful: All talks and projects MUST be written in CUDA Every hardcore hacker gets a H100 for the day. All sponsored and proved by Nebius.ai! Let's blow away some baselines.


CUDA MODE ▷ #cool-links (1 messages):

mobicham: https://salykova.github.io/matmul-cpu


CUDA MODE ▷ #torchao (30 messages🔥):

INTx performance benchmarking, Quantization techniques in torchao, INT8-weight only vs INTx-4 implementation, Evaluation script batch sizes, Token per second trends with different INTx implementations

Link mentioned: executorch/examples/models/llama2 at main · pytorch/executorch: On-device AI across mobile, embedded and edge for PyTorch - pytorch/executorch


CUDA MODE ▷ #hqq (5 messages):

Benchmark results script request, Token/second calculation code request, Transformer update issues

Links mentioned:


CUDA MODE ▷ #llmdotc (204 messages🔥🔥):

Improving helper and kernel functions in CUDA MODE, Discussion on GenericVector and its applications, Training stability with different datasets and model sizes, Streamlining the setup and memory usage optimization for llm.c, Investigating performance impacts with various configurations and settings, Exploring inference optimizations and platform simplifications

Links mentioned:


CUDA MODE ▷ #rocm (1 messages):

Performance comparison of AMD MI300X and Nvidia H100 SXM on Mixtral 8x7B inference, Advantages of Nvidia's CUDA over AMD's ROCm for AI workloads, Developer preferences for Nvidia GPUs in AI production

Link mentioned: AMD MI300X vs. Nvidia H100 SXM: Performance Comparison on Mixtral 8x7B Inference: There’s no denying Nvidia's historical dominance when it comes to AI training and inference. Nearly all production AI workloads run on their graphics cards. However, there’s been some optimism r...


CUDA MODE ▷ #sparsity (1 messages):

iss_llm: Thank you very much <@1213148470664495114> !


Perplexity AI ▷ #announcements (2 messages):

Voice-to-voice experience on Android app, Pro Search update with deeper research capabilities

The improved search delivers more informed answers, supporting complex and detailed inquiries. More information can be found here.


Perplexity AI ▷ #general (211 messages🔥🔥):

Perplexity AI search engine issues and updates, Discussion on Claude's token limitations on Perplexity AI, Methods to generate graphs or visualizations using Perplexity, Referral links and promotional codes for Perplexity AI, Perplexity AI's bias towards Indian news sites, Usage of AI models and search results accuracy, General queries about Perplexity AI's mobile app features

Links mentioned:


Perplexity AI ▷ #sharing (9 messages🔥):

Best use cases for Perplexity AI, Rubik's Cube 50th anniversary, EU charges Meta, Electric flights development, AI video advances, Starting a business with Perplexity, Emotional impact of tears, Microsoft Connect Test search

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (8 messages🔥):

API settings loading issue, Perplexity citations request follow-up, Sonnet 3.5 and Perplexity API usage, API model availability, Search engine via API

Link mentioned: Supported Models: no description found


Unsloth AI (Daniel Han) ▷ #general (130 messages🔥🔥):

Scaling synthetic data creation, Issues with Unsloth on multi-GPU setups, Unsloth's compatibility with Ollama, Unsloth's new features and updates, Use of RAG (Retrieval-Augmented Generation) with Unsloth

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (6 messages):

Microsoft updated Phi-3 Mini, New recipe training suspicion, Antropic's steering stuff and Sonnet 3.5, OpenAI develops CriticGPT

Link mentioned: Reddit - Dive into anything: no description found


Unsloth AI (Daniel Han) ▷ #help (44 messages🔥):

SPPO support for Unsloth, Common errors with xformers and PyTorch, Fine-tuning models with Unsloth and deploying on Ollama, Running Unsloth models on AMD GPUs, Handling Out of Memory (OOM) errors during model training

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (2 messages):

llmcord.py, Discord LLM frontend, jakobdylanc's GitHub project, community feedback on llmcord.py

Link mentioned: GitHub - jakobdylanc/discord-llm-chatbot: llmcord.py • Talk to LLMs with your friends!: llmcord.py • Talk to LLMs with your friends! Contribute to jakobdylanc/discord-llm-chatbot development by creating an account on GitHub.


Unsloth AI (Daniel Han) ▷ #community-collaboration (2 messages):

collaborative tutorial and notebook release, dataset support enhancements, finetuning guide available, requests for feedback on colab notebook

Link mentioned: Google Colab: no description found


Unsloth AI (Daniel Han) ▷ #research (11 messages🔥):

Scaling Synthetic Data Creation with 1,000,000,000 Personas, Persona-driven data synthesis methodology, Google released Gemma 2, Generalized Knowledge Distillation (GKD) by Google DeepMind

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (155 messages🔥🔥):

Running Stable Diffusion with limited VRAM, Issues with Stable Diffusion 3, Training LoRA (Low-Rank Adaptation) for specific styles, Stable Diffusion for different image styles, Anti-AI art software potential

Links mentioned:


OpenAI ▷ #ai-discussions (118 messages🔥🔥):

High cost and performance of AI hardware (H100 and H200 GPUs, Google TPUs), Disappointment with AI image generation tools (Luma Dream Machine, Runway Gen-3), Potential of using TPU V3-8 and Paperspace for AI training, Multimodal models and their limitations in specific domains, Orchestrators like LangChain and context limits in AI


OpenAI ▷ #prompt-engineering (3 messages):

Ensuring GPT completes all steps in task, Prompt design for intention checking in RAG on GPT-4o, Issues with GPT skipping steps in multi-step prompt


OpenAI ▷ #api-discussions (3 messages):

Customer-GPT checking itself after task completion, Intention checking prompt for RAG on GPT-4, Structuring GPT instructions to avoid skipping steps


Eleuther ▷ #general (20 messages🔥):

vLLM Deployment, Probing Learnt Representations in Diffusion Models, GPT-4 Parameter Count and Mixture of Experts

Links mentioned:


Eleuther ▷ #research (73 messages🔥🔥):

Tokenization in LLMs, Factorization Curse in Language Models, Covert Malicious Fine-Tuning, Memory Efficiency in PEFT Methods, Estimating Networking Hardware Costs for ML Clusters

Links mentioned:


Eleuther ▷ #lm-thunderdome (23 messages🔥):

Issues reproducing Gemma 2 metrics with lm_eval, `Possible fixes for Gemma 2 metric reproduction issues`, `Visualization of fewshot prompting and its impact on accuracy`, `Clarification on the evaluation framework and scoring mechanism in `lm_eval, Integration of OpenAI's evalslibrary withlm_eval``

Links mentioned:


Nous Research AI ▷ #research-papers (3 messages):

Apple's variable bit quantization in on-device LLM, Talaria tool for model visualization and optimization by Apple, Introduction of Apple Foundation Models at WWDC 2024, Introducing Terminator architecture without residuals, dot product attention, or normalization

Links mentioned:


Nous Research AI ▷ #datasets (1 messages):

PersonaHub introduction and potential use cases, Key considerations for scheduling and logistics for multi-show festivals, Distribution and organization of public services in Halifax, Application of synthetic data in LLM research and development

Link mentioned: proj-persona/PersonaHub · Datasets at Hugging Face: no description found


Nous Research AI ▷ #off-topic (2 messages):

MatrixBridge AI demo for NotDevin, Hackathon in Italy by Yaya Labs

Links mentioned:


Nous Research AI ▷ #interesting-links (1 messages):

harvie_zhang: https://x.com/leopolisdream/status/1804627325583327358?s=46&t=BsqYoGA8vIHGcXwORlMk7w


Nous Research AI ▷ #general (82 messages🔥🔥):

Animating with Vision Capable Language Models (VLLMs), Runway's High-Fidelity Video Generation, Cost Concerns with Runway's AI Tools, New techniques for VLLMs and multimodal models, Custom quantization schemes for Llama models

Links mentioned:


Nous Research AI ▷ #ask-about-llms (3 messages):

Creating conversational dataset or instructions dataset from documents, Open-source tools for generating datasets, Anthropic solutions for dataset generation with sufficient budget, Genstruct 7B model for instruction generation from raw text corpus, Inspired by Ada-Instruct model for instruction generation

Link mentioned: NousResearch/Genstruct-7B · Hugging Face: no description found


Nous Research AI ▷ #world-sim (4 messages):

Sharpening bots into question/answer bots linking to Nous, Fixed issue by apyh, User appreciation for AI scenarios, Teknium's future possibilities for bots


Modular (Mojo 🔥) ▷ #general (5 messages):

Running Mojo on Ubuntu 24.04 on Raspberry Pi 5, Link to AI engineer world fair talk, Importing .mojopkg from a subdirectory

Link mentioned: AI Engineer World’s Fair 2024 - Keynotes & Multimodality track: https://twitter.com/aidotengineer1. Opening music - 00:002. Announcement - 03:263. AI Engineer Summit Opening Remarks - 17:124. Benjamin Presentation - 17:22...


Modular (Mojo 🔥) ▷ #💬︱twitter (1 messages):

ModularBot: From Modular: https://twitter.com/Modular/status/1808228006068212110


Modular (Mojo 🔥) ▷ #ai (2 messages):

Mojo promising areas, Model and simulator-based RL for LLM agents, Symbolic reasoning and inference time search, Sub-symbolic model steering, Constrained inference


Modular (Mojo 🔥) ▷ #🔥mojo (27 messages🔥):

Mojo lifetime parameters, Using MutableLifetime and AnyLifetime in Mojo, Package manager necessity for Mojo, Difference between parallelize and sync_parallelize in Mojo, Working with List in Mojo and printing its elements

Links mentioned:


Modular (Mojo 🔥) ▷ #nightly (7 messages):

Mojo compiler nightly update, Discussion on stdlib contributors and Mojo merch, Issues with nightly/max package updates, CI transitions and resulting issues


Modular (Mojo 🔥) ▷ #mojo-marathons (44 messages🔥):

Discussion about src/main compilation issues, Benchmarking failing due to rounding errors, System specs provided for troubleshooting, Benchmarking improvements discussion, Tests and benchmarking framework updated, Discussion on data types for matrix multiplication, Brainstorming about improving matrix multiplication algorithms

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

Big update to the /models page, Changing Google Token Sizes for Gemini and PaLM models, Deprecation of Default Model setting, Deprecation of custom auth headers for OpenAI API keys


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

lastrosade: I made a quick and dirty wrapper if anyone wants it.


OpenRouter (Alex Atallah) ▷ #general (68 messages🔥🔥):

Mistral API error handling, Differences between LiteLLM and OpenRouter, Improving efficiency in conversation bots, Claude 3.5 intermittent errors, Frontend Apps for OpenRouter on iOS

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general (49 messages🔥):

Instruction-tuned models discussion, Downsides of continuing training IT over base model, Plans to add CAME or Adam-mini optimizers to axolotl, High grad_norm values while fine-tuning gemma 27b, Prioritizing accuracy in numerical answers over explanatory text

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general-help (3 messages):

Error message investigation, Configuration issues


OpenAccess AI Collective (axolotl) ▷ #axolotl-help-bot (13 messages🔥):

Training Gemma2 models with eager attention, Modifying YAML for Gemma2 eager attention, Does batch size increase with more GPUs

Links mentioned:


LlamaIndex ▷ #blog (2 messages):

Translation of Python multi-agent systems into microservices, Comprehensive video tutorial on 'llama-agents' framework, Building better knowledge assistants beyond naive RAG, Components of advanced data and retrieval modules for knowledge assistants


LlamaIndex ▷ #general (43 messages🔥):

sub-agents tutorials and applications, new agents release, integrating custom message queues, RAG-based chatbots for company data, conversation history in LlamaIndex, issues with building llama-cpp-python, Microsoft’s graph RAG architecture, DocumentSummaryIndex metadata issues with Pinecone

Links mentioned:


tinygrad (George Hotz) ▷ #general (9 messages🔥):

graph rewrite followup and speedup/different algorithm, special dtype 'image dtype', error messages and dev tooling in Tinygrad, PR with a failing test and minimal repro


tinygrad (George Hotz) ▷ #learn-tinygrad (34 messages🔥):

JIT handling of zero_grad in Tinygrad, Memory issues with gradient accumulation in Tinygrad, Equivalent of torch.no_grad() in Tinygrad, Gradient handling and parameter updates in Tinygrad, Improving documentation and examples for Tinygrad

Links mentioned:


LangChain AI ▷ #general (21 messages🔥):

Best RAG strategy: HydeRetrieval vs. MultiQueryRetrieval, LangChain usage for acknowledging messages, Copilot image storage in Edge, Sharding of embedding databases

Link mentioned: Issues · langchain-ai/langchain: 🦜🔗 Build context-aware reasoning applications. Contribute to langchain-ai/langchain development by creating an account on GitHub.


LangChain AI ▷ #langserve (3 messages):

Using FastAPI-Users with LangServe, Allowing file uploads in a LangChain project, Debugging output display issues in CSV playground


LangChain AI ▷ #langchain-templates (12 messages🔥):

Creating a chatbot using LangChain and LangChain Expression Language, Adding action capabilities like scheduling demos and connecting users to sales teams, Defining actions for a chatbot in LangChain, Creating and using Agents in LangChain, Debugging and tracing applications with LangSmith, Providing Python code for LangChain chatbot capabilities


LangChain AI ▷ #share-your-work (4 messages):

Building chatbots using LLM, RAFT pipeline development, Comparison of RAFT and traditional RAG, OpenAI's CriticGPT, Experience sharing and collaboration offers

Links mentioned:


Latent Space ▷ #ai-general-chat (36 messages🔥):

Runway Gen 3 release, Sonnet + Artifacts usage, Anthropic principles of good evals, Chatbot arena ranking issues, Goldman Sachs report on AI investments, Figma AI feature concerns, Best TTS models, Claude 3.5 Sonnet update, Updated Phi-3 Mini by Microsoft, Magic Dev's valuation increase

Links mentioned:


Mozilla AI ▷ #llamafile (35 messages🔥):

Hardware requirements for running llama.cpp, Release of llamafile v0.8.9, Testing mxbai-embed-large-v1 model, Choosing hardware for running large language models, CPU vs GPU for AI model training and inference

Links mentioned:


OpenInterpreter ▷ #general (25 messages🔥):

Building 01 on Windows, Docker capabilities with OI, Concurrency and resource isolation in OI deployments, Handling image displays in OI, Local AI agents for web browsing

Link mentioned: When I use interpreter.chat(stream=True), in what scenarios will type return 'image'? · Issue #1301 · OpenInterpreter/open-interpreter: Describe the bug When I use interpreter.chat(stream=True), in what scenarios will type return 'image'? When I try to use it in version 0.1.18, it returns image, but version 0.2.5 does not like...


OpenInterpreter ▷ #O1 (2 messages):

Portaudio installation issue on Windows 11, Pull request for updating Windows installation documentation

Link mentioned: Update documentation for Windows installation by dheavy · Pull Request #203 · OpenInterpreter/01: Problem Installation for Windows, with its key differences, isn't provided in the documentation. Solution Compile learnings from previous users' attempt (including Zorcon's on Discord and ...


Torchtune ▷ #general (26 messages🔥):

Finetuning models on GPUs and logging with WandB, Training configurations and evaluation methods in torchtune, Challenges of using AMD GPUs for AI tasks, Guidance on DPO without SFT for small datasets, Converting torchtune models to HuggingFace

Links mentioned:


Cohere ▷ #general (17 messages🔥):

Multi-step capabilities of the toolkit, Events and sessions at AI Engineer in San Francisco, Commad R models session by Sandra Kublik, LLM-UNIVERSITY channel sunsetting and support, Using Cohere's API and resources for development

Links mentioned:


Cohere ▷ #project-sharing (3 messages):

Cohere Slack bot creation, Performance requirements for Slack bots, Speed of model processing


Cohere ▷ #announcements (1 messages):

Cohere For AI event in London, Expedition Aya initiative, Benefits and activities of Expedition Aya, Multilingual AI research, Crew Connections meetings

Links mentioned:


LAION ▷ #general (2 messages):

Reminder to upgrade openssh packages, Figure 1 doing full end-to-end BMW use cases with pixel-to-action neural networks

Link mentioned: Tweet from Corey Lynch (@coreylynch): Figure 1 is now doing full end-to-end BMW use cases, with all manipulations learned as 200hz pixel-to-action neural networks. Learned behaviors need to be incredibly precise (<1cm sheet metal ins...


LAION ▷ #research (15 messages🔥):

ML model evaluation complexity, Novel LLM: phi-CTNL, Correct solution validation for AIW+ problem, Clarification on problem-solving assumptions, Introduction of new model architecture: Terminator

Links mentioned:


LLM Finetuning (Hamel + Dan) ▷ #general (7 messages):

Knowledge graphs and Lang Graph in AI projects, Voice detection and transcription models, Recording of latest talks, Lessons from a year of building with LLMs

Link mentioned: cookbook/audio-assistant at main · Chainlit/cookbook: Chainlit's cookbook repo. Contribute to Chainlit/cookbook development by creating an account on GitHub.


LLM Finetuning (Hamel + Dan) ▷ #🟩-modal (1 messages):

Processing large datasets from Kaggle competitions, Using Dask for data processing, Out of Memory (OOM) errors in Dask, Executing Dask jobs on Modal


LLM Finetuning (Hamel + Dan) ▷ #paige_when_finetune (1 messages):

shamik_53759: Yep, it's up now. Thanks!


LLM Finetuning (Hamel + Dan) ▷ #axolotl (2 messages):

Autotrainer suggestion, Clarification on Autotrainer


LLM Finetuning (Hamel + Dan) ▷ #openai (1 messages):

OpenAI credit consumption feedback


LLM Finetuning (Hamel + Dan) ▷ #bergum_rag (4 messages):

Location of slide deck in the video, Jo's slide deck request


AI Stack Devs (Yoko Li) ▷ #app-showcase (1 messages):

mikhail_ee: Some fresh locations from https://Hexagen.World


AI Stack Devs (Yoko Li) ▷ #ai-town-discuss (5 messages):

Request for Docker port, PR for Docker port suggested, GitHub page for AI Town setup on Windows using WSL shared

Link mentioned: GitHub - Ikkitsuna/AI-Town-Windows-Setup-WSL-method: Guide for setting up AI Town on Windows using WSL: Guide for setting up AI Town on Windows using WSL. Contribute to Ikkitsuna/AI-Town-Windows-Setup-WSL-method development by creating an account on GitHub.


Interconnects (Nathan Lambert) ▷ #news (5 messages):

Apple's observer seat at OpenAI, Phil Schiller joining OpenAI board as observer, Microsoft's investment vs. Apple's exposure deal, Reactions to Apple's deal with OpenAI, Comparisons between Apple's and Microsoft's deals with OpenAI

Links mentioned:


Datasette - LLM (@SimonW) ▷ #ai (2 messages):

Discussion on the evolution of internet browsing on mobile phones., American federal officials receiving foreign gifts., Historical context of early mobile internet services., Foreign gifts as data and their recording issues.

Links mentioned:






{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}