Questions tagged [language-model]

Ask Question

Language models are used extensively in Natural Language Processing (NLP) and are probability distributions over a sequence of words or terms.

154 questions

2votes

0answers

15views

Evaluation of token importance attribution based on human rationales

I am working on evaluating an explainability method for a text classification model that predicts whether a given text sequence contains hate speech or not. The method outputs token-level importance ...

Marc

asked Apr 8 at 17:18

0votes

0answers

59views

How much improvement does OpenAI o1 achieve from the chain of thought?

https://openai.com/index/learning-to-reason-with-llms/ OpenAI o1 also add more data than the last version of LLM.

CoderOnly

asked Sep 13, 2024 at 1:36

0votes

0answers

64views

For image+text, how is pre-training of Multimodal LLM generally done?

For image+text without video, how is pre-training of Multimodal Large Language Model generally done? Choice-1: Transform image to text, and then input all the text to LLM? Choice-2: Transform image to ...

CoderOnly

asked Aug 20, 2024 at 1:49

0votes

0answers

43views

Generating transaction data for a dataset to train on

My project is to predict what payment option a customer might use depending on various factors on a checkout screen. For example here are some of the fields I would have Variables : User_Location ...

Naeem Mujeeb

asked Jun 27, 2024 at 22:30

0votes

0answers

9views

What are the key quality metrics for large language model releases?

I am a first year PhD student working on improving the release practices of Machine Learning Models, especially pre-trained large language models. I want to understand the above concept for a ...

Eyinlojuoluwa

asked Jun 8, 2024 at 21:15

0votes

0answers

14views

What is query generation re-ranking method?

I am reading up on reranking methodologies that leverage LLMs. Relevant literature. One of the methods suggested is query generation Or, the same methodology from another source The task is to rank ...

figs_and_nuts

asked Jun 8, 2024 at 4:39

0votes

0answers

28views

How to find out that a conversation with a chatbot is likely ended

I'm working on a ChatBot with Python and langchain, and I'd like to have a metric that I could use to understand how close we ...

user163273

asked May 31, 2024 at 9:50

1vote

1answer

65views

Callback handlers in Langchain

This might be an odd question, but why is there two codes for the class BaseCallbackHandler? https://api.python.langchain.com/en/latest/_modules/langchain_core/callbacks/base.html#BaseCallbackHandler ...

Justin Jonany

asked May 16, 2024 at 20:42

0votes

1answer

52views

What languages llama2 supports?

Which languages llama2 supports? I looked at the docs and huggingface but I couldn't find a list. Just it says usage in other languages than English as out-of-scope.

heyula

asked Feb 29, 2024 at 14:34

0votes

1answer

50views

How can I get the list of pretrained large language models?

Is there any place I can get the list of pre-trained large language models in a neat way? Despite the most common ones like gpt, BARD, llama2, which llm do you suggest that can be used for RAG and ...

heyula

asked Feb 29, 2024 at 9:35

0votes

1answer

77views

How to check the license of a LLM for specific use?

How to check if a large language model has a license allowing to fine tune the model and then publish it publicly? How can I be sure that I can use and fine-tune a large language model without ...

heyula

asked Feb 28, 2024 at 18:15

0votes

2answers

67views

How to choose ideal pretrained model for fine-tuning?

I started to work with LLMs lately and want to know how people choose their pre-trained models in their fine-tuning tasks? What is the criteria to choose the base model and which factors affect?

heyula

asked Feb 22, 2024 at 15:07

0votes

1answer

43views

Is Machine Reading Comprehension (MRC) outdated?

I recently went through some litterature about knowledge-enhanced language models and found connections with the Machine Reading Comprehension (MRC) task. However, I couldn't find papers more recent ...

Barbara Gendron

asked Dec 18, 2023 at 14:04

1vote

1answer

613views

How can I leverage machine learning for log analysis?

I am new to data science and trying to find possibilities of using datascience in tasks. I have a set of logs which I want to convert to json. The logs are more or less of same format and I can write ...

SUNITA GUPTA

asked Dec 10, 2023 at 8:33

0votes

1answer

186views

Purely extractive Language Model

Given an email thread, I am trying to extract the body of the most recent email. I used to do that with rules. Now I am testing Large Language Models (LLM) to see if I they provide a less ad hoc ...

mirix

asked Nov 24, 2023 at 11:00

15 30 50per page

2 3 4 5

…

11 Next

Stack Exchange Network

Questions tagged [language-model]

Evaluation of token importance attribution based on human rationales

How much improvement does OpenAI o1 achieve from the chain of thought?

For image+text, how is pre-training of Multimodal LLM generally done?

Generating transaction data for a dataset to train on

What are the key quality metrics for large language model releases?

What is query generation re-ranking method?

How to find out that a conversation with a chatbot is likely ended

Callback handlers in Langchain

What languages llama2 supports?

How can I get the list of pretrained large language models?

How to check the license of a LLM for specific use?

How to choose ideal pretrained model for fine-tuning?

Is Machine Reading Comprehension (MRC) outdated?

How can I leverage machine learning for log analysis?

Purely extractive Language Model

Hot Network Questions

Questions tagged [language-model]

Related Tags