Commit 22c6415
committed
refactor: Use MergedColumnParallelLinear for Whisper cross-attention kv_proj
Address maintainer feedback:
- Replace QKVParallelLinear with MergedColumnParallelLinear for kv_proj
in WhisperCrossAttention, enabling LoRA support via existing
MergedColumnParallelLinearWithLoRA infrastructure
- Update weight loading to use integer shard indices (0, 1) instead of
string identifiers ("k", "v") for MergedColumnParallelLinear
- Remove redundant embedding_modules and embedding_padding_modules
attributes from WhisperForConditionalGeneration
- Remove example file (similar to existing multilora_inference.py)
- Rollback LoRA layer changes as they are no longer needed
- Update tests to reflect new architecture
Signed-off-by: daje0601 <englishmt4118@gmail.com>1 parent ba3826b commit 22c6415
File tree
4 files changed
+55
-222
lines changed- examples/offline_inference
- tests/lora
- vllm
- lora/layers
- model_executor/models
4 files changed
+55
-222
lines changedLines changed: 0 additions & 136 deletions
This file was deleted.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | | - | |
| 8 | + | |
9 | 9 | | |
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
15 | 15 | | |
16 | | - | |
| 16 | + | |
17 | 17 | | |
18 | | - | |
| 18 | + | |
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
| |||
36 | 36 | | |
37 | 37 | | |
38 | 38 | | |
39 | | - | |
40 | | - | |
41 | | - | |
42 | | - | |
43 | | - | |
44 | | - | |
45 | | - | |
46 | | - | |
47 | | - | |
48 | | - | |
49 | | - | |
50 | | - | |
51 | 39 | | |
52 | 40 | | |
53 | 41 | | |
| |||
63 | 51 | | |
64 | 52 | | |
65 | 53 | | |
66 | | - | |
67 | | - | |
| 54 | + | |
| 55 | + | |
68 | 56 | | |
69 | 57 | | |
70 | | - | |
| 58 | + | |
71 | 59 | | |
72 | 60 | | |
73 | | - | |
74 | | - | |
75 | | - | |
76 | | - | |
77 | | - | |
78 | | - | |
79 | | - | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
80 | 66 | | |
81 | 67 | | |
82 | 68 | | |
| |||
88 | 74 | | |
89 | 75 | | |
90 | 76 | | |
91 | | - | |
| 77 | + | |
92 | 78 | | |
93 | | - | |
| 79 | + | |
94 | 80 | | |
95 | 81 | | |
96 | 82 | | |
97 | 83 | | |
98 | 84 | | |
99 | | - | |
100 | | - | |
101 | | - | |
102 | | - | |
103 | | - | |
104 | | - | |
105 | | - | |
106 | | - | |
107 | | - | |
108 | | - | |
109 | | - | |
| 85 | + | |
110 | 86 | | |
111 | | - | |
112 | | - | |
113 | | - | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
114 | 90 | | |
115 | 91 | | |
116 | 92 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
356 | 356 | | |
357 | 357 | | |
358 | 358 | | |
| 359 | + | |
| 360 | + | |
359 | 361 | | |
360 | 362 | | |
361 | 363 | | |
| |||
364 | 366 | | |
365 | 367 | | |
366 | 368 | | |
367 | | - | |
368 | | - | |
369 | | - | |
370 | | - | |
371 | | - | |
372 | | - | |
373 | | - | |
374 | | - | |
375 | | - | |
376 | | - | |
377 | | - | |
378 | | - | |
379 | | - | |
380 | | - | |
381 | | - | |
382 | | - | |
383 | | - | |
| 369 | + | |
| 370 | + | |
| 371 | + | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
384 | 379 | | |
385 | 380 | | |
386 | 381 | | |
| |||
403 | 398 | | |
404 | 399 | | |
405 | 400 | | |
406 | | - | |
407 | | - | |
408 | | - | |
409 | | - | |
410 | | - | |
| 401 | + | |
411 | 402 | | |
412 | 403 | | |
413 | 404 | | |
| |||
548 | 539 | | |
549 | 540 | | |
550 | 541 | | |
551 | | - | |
552 | | - | |
553 | | - | |
554 | | - | |
555 | | - | |
556 | | - | |
557 | | - | |
558 | | - | |
559 | | - | |
560 | | - | |
561 | | - | |
562 | | - | |
| 542 | + | |
| 543 | + | |
| 544 | + | |
| 545 | + | |
| 546 | + | |
| 547 | + | |
| 548 | + | |
| 549 | + | |
| 550 | + | |
| 551 | + | |
| 552 | + | |
| 553 | + | |
| 554 | + | |
| 555 | + | |
| 556 | + | |
563 | 557 | | |
564 | 558 | | |
565 | 559 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| 30 | + | |
30 | 31 | | |
31 | 32 | | |
32 | 33 | | |
| |||
323 | 324 | | |
324 | 325 | | |
325 | 326 | | |
326 | | - | |
327 | | - | |
328 | | - | |
329 | | - | |
330 | | - | |
| 327 | + | |
| 328 | + | |
| 329 | + | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
331 | 333 | | |
332 | 334 | | |
333 | 335 | | |
| |||
631 | 633 | | |
632 | 634 | | |
633 | 635 | | |
634 | | - | |
635 | | - | |
| 636 | + | |
| 637 | + | |
| 638 | + | |
636 | 639 | | |
637 | 640 | | |
638 | 641 | | |
| |||
781 | 784 | | |
782 | 785 | | |
783 | 786 | | |
784 | | - | |
785 | | - | |
786 | | - | |
787 | | - | |
788 | 787 | | |
789 | 788 | | |
790 | 789 | | |
| |||
0 commit comments