Download>Category>Hot Applications >AI>ASPEN: High-throughput LoRA fine-tuning of large language models using a single GPU

pdf

ASPEN: High-throughput LoRA fine-tuning of large language models using a single GPU

2024-05-13
1.19MB
Points it Requires : 2

repReport

Document Introduction
You Might Like
Recommended Downloads

Transformer-based large language models (LLMs) have shown excellent performance in different domains, especially when fine-tuned for specific domains. Recent studies have shown that the resources required for fine-tuning LLMs can be saved by parameter-efficient methods such as low-rank adaptation (LoRA). While LoRA effectively reduces the computational burden and resource requirements, it currently only supports single-job fine-tuning settings. In this paper, we introduce ASPEN, a high-throughput framework for fine-tuning LLMs. ASPEN uses the LoRA approach to efficiently train multiple jobs on a single GPU, leveraging shared pre-trained models and adaptive scheduling. ASPEN is compatible with transformer-based language models such as LLaMA and ChatGLM. Experiments show that ASPEN saves 53% of GPU memory when training multiple LLaMA-7B models on an NVIDIA A100 80GB GPU, and improves training throughput by about 17% compared to existing methods when training with various pre-trained models on different GPUs. An adaptive scheduling algorithm that prioritizes jobs and prevents out-of-memory issues improves turnaround time by 24% and reduces end-to-end training latency by 12%.

unfold

You Might Like

Uploader

: 念慈菴

Recommended ContentMore

Open source project More

Popular Components

Searched by Users

Just Take a LookMore

Linearity correction of amplifier circuit
Dear technical experts, I have an amplifier circuit, but the linearity is not right after it is made. Can anyone help me fix it?
Network Control of ROS Melodic
xukejingROS is about robots, not software routers. The R in this article is not Route, but the abbreviation of Robot. A very important concept in the ROS system is the node . A ROS robot system is com
Detailed explanation of starting surge current
This document describes the starting inrush current. Since all switching power supplies have energy storage capacitors at the primary end according to different power levels,there will be a large impa
Crazy Shell AI open source drone remote controller firmware burning
Remote control firmware burningThe firmware here refers to the Hex or Bin file compiled from the source code, where Hex is a hexadecimal file and Bin is a binary file. The following figure shows the H
Showing off the goods + the development boards we have used together over the years
[i=s]This post was last edited by anananjjj on 2019-11-9 20:15[/i]Seeing everyone showing off their development boards, I decided to join in the fun! The pictures show only a portion of my collection.
A newbie asks for help, IAR compiles and reports undefined errors
Full range of transistor application parameters
The motherboard of Huawei P30 looks like this
RS-485 Transceiver
ST U5 development board evaluation and programming related issues (official sharing)

Trending Downloads

Trending ArticlesMore

EEWorld
subscription
account

EEWorld
service
account

Automotive
development
circle

About Us Customer Service Contact Information Datasheet Sitemap LatestNews

Room 1530, 15th Floor, Building B, No.18 Zhongguancun Street, Haidian District, Beijing, Postal Code: 100190 China Telephone: 008610 8235 0740

Copyright © 2005-2024 EEWORLD.com.cn, Inc. All rights reserved 京ICP证060456号京ICP备10001474号-1 电信业务审批[2006]字第258号函

京公网安备 11010802033920号

×