Newest 'algorithm-request' Questions

3votes

1answer

91views

Is DPG a policy-based method or an actor-critic method?

I have a question about whether the Deterministic Policy Gradient algorithm in it's basic form is policy-based or actor-critic. I have been searching for the answer for a while and in some cases it ...

marc_spector

57

asked Nov 12, 2024 at 11:08

2votes

1answer

72views

Is there an algorithm capable of telling knots and links apart?

I'd like to create some algorithm that can tell apart knots from links. A knot is made up of one line, a link with multiple. As input, we expect images as above. Is there an algorithm capable of ...

Csaba Daniel Farkaš

123

asked May 3, 2024 at 15:36

2votes

2answers

855views

Is there an algorithm that produces a uniform distribution over the set of trajectories with maximum reward sum?

I am not an expert in reinforcement learning. I am applying it to my field of study. I am training a model such that given a state, it predicts the probability of taking an action for every action ...

moe asal

143

asked Apr 9, 2024 at 18:46

0votes

1answer

59views

Machine Learning Algorithm for identifying the factors contributing to academic performance of students

I have a dataset with several qualitative and quantitative attributes, including age, location (longitude, latitude), city, parent occupation, family size, GPA etc. My task is to find the attributes/...

Dawood Ahmad

43

asked Apr 2, 2024 at 13:13

0votes

0answers

30views

Implement tensor operation with mini batches instead of Matrix multiplication in forward pass

The forward pass in neural network can be written as g(Wx+b). W is the weights Matrix, x is the input vector and b is bias, and g the non linearity, the activation function. However x can have more ...

CoffeDeveloper

291

asked Mar 30, 2024 at 7:57

0votes

0answers

57views

Symmetrical AI (same inputs as outputs) for Modelling and Prediction

I was wondering if there is an algorithm that helps create 'symmetrical' AIs (an AI that has the same inputs as it has outputs) that could be used for 'simulating' a system. The idea would be that ...

Steve Harding

1

asked Feb 18, 2024 at 11:01

3votes

0answers

41views

Algorithms for average reward reinforcement learning in continuous/general state-action space

I see that discounted reward reinforcement learning has been extensively studied in the literature. However, the average reward metric receives less attention, and it looks like algorithms for this ...

k2pctdn

155

asked Oct 10, 2023 at 19:47

3votes

1answer

640views

Match two paragraphs of text

I'm building a friend finder app and I need to match people based on a paragraph of text. Here is an example of what I mean: Person A: I love walking and going to the beach, I also love reading and ...

Dom

31

asked Jul 20, 2023 at 10:34

2votes

1answer

61views

What are the most effective methods and tools for summarizing long-form content like articles, editorials, and discussion threads for an app?

With users expecting instantaneous information and no compromise on in-depth details, app developers are challenged to condense long-form content such as articles, editorials, and discussion threads ...

Anshu Kumar Gupta

29

asked Feb 18, 2023 at 12:20

2votes

1answer

73views

Is there any interpretation method suitable for CNNs which do regression tasks?

I mainly tackle regression problems by CNNs, and want to find a reliable method to calculate the heatmaps for NN's results. However, I find almost all interpretation methods including CAM is used for ...

minghuisvn

21

asked Dec 21, 2022 at 1:19

1vote

1answer

43views

What models/algorithms besides variational autoencoders can I use to transform a discrete input into a differentiable latent space?

Let's say I have a discrete input and want to transform it into a differentiable latent space. What models/algorithms besides variational autoencoders can I use?

postnubilaphoebus

356

asked Dec 2, 2022 at 12:27

1vote

0answers

97views

What strategies are there to reduce the variance of the policy gradient estimator of the REINFORCE algorithm?

What strategies are there to reduce the variance of the policy gradient estimator of the REINFORCE algorithm? I know one possibility is to subtract a baseline as a running average of rewards from past ...

postnubilaphoebus

356

asked Nov 24, 2022 at 0:57

0votes

1answer

160views

Machine Learning Methods commonly used when data are scarse

It is well-known that deep neural networks require lots of data to perform reliably and well. A commonly-cited statistic is that you need at least 10,000 examples per class for a classification ...

postnubilaphoebus

356

asked Nov 15, 2022 at 15:37

2votes

0answers

37views

What are some non-RL-based approaches to solving a typical bin assignment problem?

What are some non-RL-based approaches to solving a typical bin assignment problem, i.e., given a set of items (can be multidimensional), find the bin/knapsack/target which best packs (with minimum ...

helloworld

65

asked Nov 11, 2022 at 3:07

3votes

0answers

152views

Are there Reinforcement Learning algorithms specialized for the case $\gamma=0$?

I have a Reinforcement Learning problem where the optimal policy does not depend on the next state (ie gamma equals 0). I think this means that I only need an efficient exploration algorithm coupled ...

AJSV

31

asked Oct 15, 2022 at 7:33

Stack Exchange Network

Questions tagged [algorithm-request]

Is DPG a policy-based method or an actor-critic method?

Is there an algorithm capable of telling knots and links apart?

Is there an algorithm that produces a uniform distribution over the set of trajectories with maximum reward sum?

Machine Learning Algorithm for identifying the factors contributing to academic performance of students

Implement tensor operation with mini batches instead of Matrix multiplication in forward pass

Symmetrical AI (same inputs as outputs) for Modelling and Prediction

Algorithms for average reward reinforcement learning in continuous/general state-action space

Match two paragraphs of text

What are the most effective methods and tools for summarizing long-form content like articles, editorials, and discussion threads for an app?

Is there any interpretation method suitable for CNNs which do regression tasks?

What models/algorithms besides variational autoencoders can I use to transform a discrete input into a differentiable latent space?

What strategies are there to reduce the variance of the policy gradient estimator of the REINFORCE algorithm?

Machine Learning Methods commonly used when data are scarse

What are some non-RL-based approaches to solving a typical bin assignment problem?

Are there Reinforcement Learning algorithms specialized for the case $\gamma=0$?

Hot Network Questions

Questions tagged [algorithm-request]

Related Tags