Why there is no LlavaForSequenceClassification ? #31814

Zuhashaik · 2024-07-06T10:03:19Z

Feature request

Need of LlavaForSequenceClassification class in 'src/transformers/models/llava/modeling_llava.py'..

Motivation

If there is a LlavaForSequenceClassification class we can use decoder only models for classification tasks.

Your contribution

Allow me to contribute to this as I am already working on this model for mental health meme classification for the upcoming AAAI conference.

The text was updated successfully, but these errors were encountered:

amyeroberts · 2024-07-06T22:56:07Z

Hi @Zuhashaik, thanks for opening this issue!

We only add specific task heads for models if there are official checkpoints available; the task is described in the original paper or there's a large community demand.

NielsRogge · 2024-07-08T09:33:52Z

Note that you could technically also use LlavaForConditionalGeneration by training the model to only spit out one classification token per image.

Zuhashaik · 2024-07-08T15:21:28Z

Note that you could technically also use LlavaForConditionalGeneration by training the model to only spit out one classification token per image.

Yeah I've attached an score layer at the top of the model by passing those last layer hidden states. While doing this, some cuda related issues ("data in different devices cuda:0 and cuda:1") came up.

So I've changed my LlavaForConditionalGeneration class itself in modelling_llava.py file (added score layer) and used that for classification.

Thankyou.

Zuhashaik · 2024-07-12T09:42:47Z

I am facing lots of data type and quantization errors while defining classifier (score) separately please initiate LlavaForSequenceClassification and SequenceClassification for vision models..

Hi @Zuhashaik, thanks for opening this issue!

We only add specific task heads for models if there are official checkpoints available; the task is described in the original paper or there's a large community demand.

Thank you!

Zuhashaik added the Feature request Request for a new feature label Jul 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why there is no LlavaForSequenceClassification ? #31814

Why there is no LlavaForSequenceClassification ? #31814

Zuhashaik commented Jul 6, 2024 •

edited

Loading

amyeroberts commented Jul 6, 2024

NielsRogge commented Jul 8, 2024

Zuhashaik commented Jul 8, 2024

Zuhashaik commented Jul 12, 2024

Why there is no LlavaForSequenceClassification ? #31814

Why there is no LlavaForSequenceClassification ? #31814

Comments

Zuhashaik commented Jul 6, 2024 • edited Loading

Feature request

Motivation

Your contribution

amyeroberts commented Jul 6, 2024

NielsRogge commented Jul 8, 2024

Zuhashaik commented Jul 8, 2024

Zuhashaik commented Jul 12, 2024

Zuhashaik commented Jul 6, 2024 •

edited

Loading