gEEkstr33t @jamalir - Tumblr Blog

Paper page - Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Join the discussion on this paper page

#machine learning #open source #ml #deep learning #multi modal #pre training

rasbt/llama-3.2-from-scratch · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#machine learning #open source #deep learning #ml #llm #hugging face #llama #instructions

#machine learning #deep learning #llm #gemma3 #hugging face #training #qlora #transformers #fine tuning #guide

https://x.com/llamafactory_ai/status/1893879214727991504?t=0rz_iG3YO_ppFRatiDqh0A&s=09

#llama #deep learning #machine learning #llm #vlm #training #fine tuning

Meta AI Releases the Video Joint Embedding Predictive Architecture (V-JEPA) Model: A Crucial Step in Advancing Machine Intelligence - MarkTechPost

Meta AI Releases the Video Joint Embedding Predictive Architecture (V-JEPA) Model: A Crucial Step in Advancing Machine Intelligence

#machine learning #deep learning #meta #video #self supervised learning #video understanding #motion predictions

Paper page - Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Join the discussion on this paper page

#machine learning #deep learning #vlm #text understanding

SmolVLM2: Bringing Video Understanding to Every Device

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#machine learning #deep learning #ml #vlm #video language models #video understanding

Magma: A Foundation Model for Multimodal AI Agents

#mllm #magma #machine learning #deep learning #open source #microsoft #video understanding

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

A Blog post by Daniel Voigt Godoy on Hugging Face

#machine learning #deep learning #ml #open source #hugging face #fine tuning #training #llm #guides

TGI Multi-LoRA: Deploy Once, Serve 30 Models

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#machine learning #deep learning #guides #lora #fine tuning #hugging face

Paper page - DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Join the discussion on this paper page

#rag #llm #knowledge retrieval #deep learning #ml

Qwen2.5 VL! Qwen2.5 VL! Qwen2.5 VL! | Qwen

QWEN CHAT GITHUB HUGGING FACE MODELSCOPE DISCORD We release Qwen2.5-VL, the new flagship vision-language model of Qwen and also a significan

#llm #multimodal #deep learning #qwen #vision #document understanding

Open-R1: a fully open reproduction of DeepSeek-R1

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#deepseek #hugging face #report #deep learning #reinforcement learning #llm

GitHub - DAMO-NLP-SG/VideoLLaMA3: Frontier Multimodal Foundation Models for Image and Video Understanding

Frontier Multimodal Foundation Models for Image and Video Understanding - DAMO-NLP-SG/VideoLLaMA3

#vision transformers #deep learning #machine learning #ml #video language models #video summarization #videollama3

Paper page - MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Join the discussion on this paper page

#machine learning #deep learning #ml #transformers #hugging face #speech to text #text to speech #multi modal #llm

Paper page - 1.58-bit FLUX

Join the discussion on this paper page

#machine learning #deep learning #vision transformers #flux #open source #transformers #computer vision

Apollo: An Exploration of Video Understanding in Large Multimodal Models

#ml #machine learning #deep learning #transformers #multi modal #video language models #video summarization #video understand

Trending Blogs

Recently Viewed Blogs

gEEkstr33t