MLA2024 | Ziuch の Blog

type

status

date

slug

summary

tags

category

icon

password

Last edited time

Nov 7, 2024 01:21 PM

😀

MLA2024官方版下载丨最新版下载丨绿色版下载丨APP下载-123云盘

123云盘为您提供MLA2024最新版正式版官方版绿色版下载,MLA2024安卓版手机版apk免费下载安装到手机,支持电脑端一键快捷安装

MLA2024官方版下载丨最新版下载丨绿色版下载丨APP下载-123云盘

https://www.123865.com/s/wuRRVv-3w243

https://www.yipai360.com/photolivepc/?orderId=202409252014036913

MLA2024会议程序册.pdf

https://mla2024.bdaa.pro/

📝 主旨内容

开幕式

notion image

notion image

报告

Exploring the New Frontiers of AI – ByteDance Research's Exploration

notion image

notion image

字节跳动AI实验室:

Robotic

AI for Science

Responsible AI

AI Foundation: Large AI Models

蛋白质建模与设计 —— CryoFM，DPLM，DPLM2

💡

DPLM-2是一种多模态蛋白模型，通过联合序列和结构生成，提高了蛋白质建模效率和精度

bytedance • Updated Nov 27, 2024

DPLM-2: A Multimodal Diffusion Protein Language Model

Proteins are essential macromolecules defined by their amino acid sequences, which determine their three-dimensional structures and, consequently, their functions in all living organisms....

https://arxiv.org/abs/2410.13782

机器人 —— GR-1，GR-2

💡

GR-2通过视频生成预训练和机器人数据微调，实现多视角条件下的视觉操控

bytedance • Updated Nov 26, 2024

gr2-manipulation.github.io

https://gr2-manipulation.github.io/

notion image

端到端同声传译 —— CLASI(Cross Language Agent – Simultaneous Interpretation)

💡

通过处理当前音频输入，结合外部知识检索和历史上下文信息，实时生成高质量的翻译。

byteresearchcla.github.io

https://byteresearchcla.github.io/clasi/

notion image

notion image

视频生成 —— PixelDance

openaccess.thecvf.com

https://openaccess.thecvf.com/content/CVPR2024/papers/Zeng_Make_Pixels_Dance_High-Dynamic_Video_Generation_CVPR_2024_paper.pdf

Make Pixels Dance: High-Dynamic Video Generation

Make Pixels Dance: High-Dynamic Video Generation

https://makepixelsdance.github.io/

notion image

notion image

Dreamina: Free AI Image Generator - Create Art & Images from Text

Create stunning art, images and more with prompts. Turn your images into captivating animations. Dreamina is an AI platform designed to simplify your creation.

Dreamina: Free AI Image Generator - Create Art & Images from Text

https://dreamina.capcut.com/

Dreamina: Free AI Image Generator - Create Art & Images from Text

notion image

细粒度多模态场景理解与生成

notion image

基于大模型的神经符号计算

notion image

大模型检索增强

notion image

notion image

notion image

报告内容

通用文本表征特征

学习索引

RAG

notion image

Lighter And Better: Towards Flexible Context Adaptation For...

The existing Retrieval-Augmented Generation (RAG) systems face significant challenges in terms of cost and effectiveness. On one hand, they need to encode the lengthy retrieved contexts before...

https://arxiv.org/abs/2409.15699

notion image

💡

FlexRAG通过压缩上下文嵌入，提升生成质量并降低成本，实现灵活高效的RAG系统

海报

Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection

💡

提出MCM方法，将OOD检测从单模态扩展到多模态，显著提升检测性能

ECCV2024: Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection

Companion talk of ECCV2024 paper: Zihan Zhang*, Zhuo Xu*, Xiang Xiang* Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection. In ECCV, 2024. Abstract: Out-of-distribution (OOD) detection is a significant challenge in deploying pattern recognition and machine learning models, as models often fail on data from novel distributions. Recent vision-language models (VLMs) such as CLIP have shown promise in OOD detection through their generalizable multimodal representations. Existing CLIP-based OOD detection methods only utilize a single modality of in-distribution (ID) information (\eg, textual cues). However, we find that the ID visual information helps to leverage CLIP's full potential for OOD detection. In this paper, we pursue a different approach and explore the regime to leverage both the visual and textual ID information. Specifically, we propose Dual-Pattern Matching (DPM), efficiently adapting CLIP for OOD detection by leveraging both textual and visual ID patterns. DPM stores ID class-wise text features as the textual pattern and the aggregated ID visual information as the visual pattern. At test time, the similarity to both patterns is computed to detect OOD inputs. We further extend DPM with lightweight adaptation for enhanced OOD detection. Experiments demonstrate DPM's advantages, outperforming existing methods on common benchmarks. The dual-pattern approach provides a simple yet effective way to exploit multi-modality for OOD detection with vision-language representations.

ECCV2024: Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection

https://www.youtube.com/watch?v=XuKXwbdLS9I

ECCV2024: Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection

顶会回顾

ICLR——北京大学袁粒

拒稿

转投CVPR oral

notion image

PKU-YUAN-Lab (袁粒课题组-北大信工)

Open codes from YUAN Lab at PKU. PKU-YUAN-Lab (袁粒课题组-北大信工) has 21 repositories available. Follow their code on GitHub.

PKU-YUAN-Lab (袁粒课题组-北大信工)

https://github.com/PKU-YuanGroup

PKU-YUAN-Lab (袁粒课题组-北大信工)

notion image

ECCV2024——重庆大学

3D视觉

复杂分割华为火花奖

🤗 总结归纳

📎 参考文章

CLASI ：字节跳动开发的端到端语音同步翻译系统模拟专业的人类翻译

CLASI是由字节跳动开发的一个高质量的

CLASI ：字节跳动开发的端到端语音同步翻译系统模拟专业的人类翻译

https://xiaohu.ai/p/11898

CLASI ：字节跳动开发的端到端语音同步翻译系统模拟专业的人类翻译

作者:ziuch
链接:https://ziuch.com/article/MLA2024
声明:本文采用 CC BY-NC-SA 4.0 许可协议，转载请注明出处。

机场非合作目标跑道入侵检测demo MPPC——多重配对像素一致性解决瑕疵缺陷检测

Loading...

ziuch

ziuch

一个普通的干饭人🍚

最新发布

多光谱和视频数据预测身体指标

Hawk: 学习理解开放世界视频异常(NeurIPS 2024 Poster)

ICASSP Rebuttal

基于矩阵乘法对GPU烤机

CFM的踩坑指南

M3DM-pointnet2_ops踩坑指北

公告