nsa@kbin.social · 2 years agoWhat's In My Big Data?plus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up12
arrow-up12external-linkWhat's In My Big Data?plus-squarearxiv.orgnsa@kbin.social · 2 years agomessage-square0fedilink
nsa@kbin.social · 2 years agoThe Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AIplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up12
arrow-up12external-linkThe Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AIplus-squarearxiv.orgnsa@kbin.social · 2 years agomessage-square0fedilink
KingsmanVince@kbin.social · 2 years agoMM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasksplus-squareaclanthology.orgexternal-linkmessage-square0fedilinkarrow-up12
arrow-up12external-linkMM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasksplus-squareaclanthology.orgKingsmanVince@kbin.social · 2 years agomessage-square0fedilink
KingsmanVince@kbin.social · 2 years agoDemystifying CLIP Dataplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up12
arrow-up12external-linkDemystifying CLIP Dataplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0fedilink
nsa@kbin.social · 2 years agoGPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problemsplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up11
arrow-up11external-linkGPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problemsplus-squarearxiv.orgnsa@kbin.social · 2 years agomessage-square0fedilink
KingsmanVince@kbin.social · 2 years agoPaLI-3 Vision Language Models: Smaller, Faster, Strongerplus-squarearxiv.orgexternal-linkmessage-square3fedilinkarrow-up12
arrow-up12external-linkPaLI-3 Vision Language Models: Smaller, Faster, Strongerplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square3fedilink
KingsmanVince@kbin.social · 2 years agoMiniGPT-v2: large language model as a unified interface for vision-language multi-task learningplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up13
arrow-up13external-linkMiniGPT-v2: large language model as a unified interface for vision-language multi-task learningplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0fedilink
KingsmanVince@kbin.social · 2 years agoFinetune Like You Pretrain: Improved Finetuning of Zero-Shot Vision Modelsplus-squareopenaccess.thecvf.comexternal-linkmessage-square0fedilinkarrow-up12
arrow-up12external-linkFinetune Like You Pretrain: Improved Finetuning of Zero-Shot Vision Modelsplus-squareopenaccess.thecvf.comKingsmanVince@kbin.social · 2 years agomessage-square0fedilink
nsa@kbin.social · 2 years agoA Long Way to Go: Investigating Length Correlations in RLHFplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up12
arrow-up12external-linkA Long Way to Go: Investigating Length Correlations in RLHFplus-squarearxiv.orgnsa@kbin.social · 2 years agomessage-square0fedilink
nsa@kbin.social · 2 years agoThink before you speak: Training Language Models With Pause Tokensplus-squarearxiv.orgexternal-linkmessage-square1fedilinkarrow-up13
arrow-up13external-linkThink before you speak: Training Language Models With Pause Tokensplus-squarearxiv.orgnsa@kbin.social · 2 years agomessage-square1fedilink
KingsmanVince@kbin.social · 2 years agoCLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say Noplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up13
arrow-up13external-linkCLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say Noplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0fedilink
nsa@kbin.social · 2 years agoLanguage Modeling Is Compressionplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up12
arrow-up12external-linkLanguage Modeling Is Compressionplus-squarearxiv.orgnsa@kbin.social · 2 years agomessage-square0fedilink
KingsmanVince@kbin.social · 2 years agoScaling Vision-Language Models with Sparse Mixture of Expertsplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up13
arrow-up13external-linkScaling Vision-Language Models with Sparse Mixture of Expertsplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0fedilink
KingsmanVince@kbin.social · 2 years agoHydra-MoE: A new class of Open-Source Mixture of Expertsplus-squaregithub.comexternal-linkmessage-square0fedilinkarrow-up14
arrow-up14external-linkHydra-MoE: A new class of Open-Source Mixture of Expertsplus-squaregithub.comKingsmanVince@kbin.social · 2 years agomessage-square0fedilink
KingsmanVince@kbin.social · 2 years agoBridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasksplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up13
arrow-up13external-linkBridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasksplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0fedilink
KingsmanVince@kbin.social · 2 years agoFoundational Models Defining a New Era in Vision: A Survey and Outlookplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up13
arrow-up13external-linkFoundational Models Defining a New Era in Vision: A Survey and Outlookplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0fedilink
KingsmanVince@kbin.social · 2 years agoUnifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Language Pre-trainingplus-squareaclanthology.orgexternal-linkmessage-square1fedilinkarrow-up15
arrow-up15external-linkUnifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Language Pre-trainingplus-squareaclanthology.orgKingsmanVince@kbin.social · 2 years agomessage-square1fedilink
KingsmanVince@kbin.social · 2 years agoMaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasksplus-squarearxiv.orgexternal-linkmessage-square1fedilinkarrow-up13
arrow-up13external-linkMaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasksplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square1fedilink
KingsmanVince@kbin.social · 2 years agoVision Language Transformers: A Surveyplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up13
arrow-up13external-linkVision Language Transformers: A Surveyplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0fedilink
Lenguador@kbin.social · 2 years agoReal-Time Radiance Field Renderingplus-squarehuggingface.coexternal-linkmessage-square0fedilinkarrow-up11
arrow-up11external-linkReal-Time Radiance Field Renderingplus-squarehuggingface.coLenguador@kbin.social · 2 years agomessage-square0fedilink