Research
-
A High-level Overview of Large Language ModelsResearch/NLP_YS2024 2024. 12. 1. 08:55
https://rbcborealis.com/research-blogs/a-high-level-overview-of-large-language-models/Jul, 12, 2023Since 2022, a series of AI systems have been introduced that enable machines to read, analyze, interpret, and derive meaning from human language. One such system is ChatGPT, which gained a over a hundred million users within a mere two months of its launch in November 2022. Its successor, GPT-4 was..
-
(3/3) An Introduction to Vision-Language ModelingResearch/Multimodal 2024. 11. 29. 23:44
https://arxiv.org/pdf/2405.172474. Approaches for Responsible VLM EvaluationAs the main ability of VLMs is to map text with images, it is crucial to measure visio-linguistic abilities so as to ensure that the words are actually mapping to visual clues. Early tasks used to evaluate VLMs were image captioning and Visual Question Answering (VQA) [Antol et al., 2015]. In this section, we also discus..
-
(2/3) An Introduction to Vision-Language ModelingResearch/Multimodal 2024. 11. 29. 11:36
https://arxiv.org/pdf/2405.172473. A Guide to VLM TrainingSeveral works [Henighan et al., 2020b,a] have shed light on the importance of scaling to push further the performances of deep neural networks. Motivated by these scaling laws, most recent works have focused on increasing compute and scale to learn better models. This led to a model like CLIP [Radford et al., 2021] which was trained on 40..
-
(1/3) An Introduction to Vision-Language ModelingResearch/Multimodal 2024. 11. 24. 21:42
https://arxiv.org/pdf/2405.17247AbstractFollowing the recent popularity of Large Language Models (LLMs), several attempts have been made to extend them to the visual domain. From having a visual assistant that could guide us through unfamiliar environments to generative models that produce images using only a high-level text description, the vision-language model (VLM) applications will signific..
-
[Attention Rollout] Explainability for Vision TransformersResearch/NLP_reference 2024. 11. 22. 09:16
https://jacobgil.github.io/deeplearning/vision-transformer-explainabilityhttps://github.com/jacobgil/vit-explainBackgroundIn the last few months before writing this post, there seems to be a sort of a breakthrough in bringing Transformers into the world of Computer Vision. To list a few notable works about this:An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale,Training d..
-
[Project Proposal] Improving the performance of machine-generated text (MGT) detection by identifying the significance of individual tokensResearch/NLP_Paper 2024. 11. 11. 14:49
※ 수정 중..!!※ This is the project proposal for Team 5 in the 2024 NLP class.※ The main idea for this project was provided by D.H. Lee.※ The content of this proposal is based on discussions with our team members: S.J.Kim, D.H.Lee, S.J.Lee, S.Y.Park.※ The final proposal PPT will be created in collaboration with S.J.Lee.※ The paper review presentation will be given by S.J.Kim.※ The proposal & project..
-
Causal Interpretation of Self-Attention in Pre-Trained TransformersResearch/NLP_Paper 2024. 11. 11. 10:38
https://arxiv.org/pdf/2310.20307(Oct 2023, NeurIPS) ※ 2024 NLP class team project's subjectAbstractWe propose a causal interpretation of self-attention in the Transformer neural network architecture. We interpret self-attention as a mechanism that estimates a structural equation model for a given input sequence of symbols (tokens). The structural equation model can be interpreted, in turn, as a ..
-
Enhancing Machine-Generated Text Detection: Adversarial Fine-Tuning of Pre-Trained Language ModelsResearch/NLP_Paper 2024. 11. 10. 22:17
※ 2024 NLP class team project's research subjectAbstractAdvances in large language models (LLMs) have revolutionized the natural language processing field. However, the text generated by LLMs can result in various issues, such as fake news, misinformation, and social media spam. In addition, detecting machine-generated text is becoming increasingly difficult because it produces text that resembl..