Skip to main content

Questions tagged [algorithm-request]

Use this tag when you're looking for an algorithm (in the context of artificial intelligence) to solve your specific problem.

3votes
1answer
91views

Is DPG a policy-based method or an actor-critic method?

I have a question about whether the Deterministic Policy Gradient algorithm in it's basic form is policy-based or actor-critic. I have been searching for the answer for a while and in some cases it ...
marc_spector's user avatar
2votes
1answer
72views

Is there an algorithm capable of telling knots and links apart?

I'd like to create some algorithm that can tell apart knots from links. A knot is made up of one line, a link with multiple. As input, we expect images as above. Is there an algorithm capable of ...
Csaba Daniel Farkaš's user avatar
2votes
2answers
855views

Is there an algorithm that produces a uniform distribution over the set of trajectories with maximum reward sum?

I am not an expert in reinforcement learning. I am applying it to my field of study. I am training a model such that given a state, it predicts the probability of taking an action for every action ...
moe asal's user avatar
0votes
1answer
59views

Machine Learning Algorithm for identifying the factors contributing to academic performance of students

I have a dataset with several qualitative and quantitative attributes, including age, location (longitude, latitude), city, parent occupation, family size, GPA etc. My task is to find the attributes/...
Dawood Ahmad's user avatar
0votes
0answers
30views

Implement tensor operation with mini batches instead of Matrix multiplication in forward pass

The forward pass in neural network can be written as g(Wx+b). W is the weights Matrix, x is the input vector and b is bias, and g the non linearity, the activation function. However x can have more ...
CoffeDeveloper's user avatar
0votes
0answers
57views

Symmetrical AI (same inputs as outputs) for Modelling and Prediction

I was wondering if there is an algorithm that helps create 'symmetrical' AIs (an AI that has the same inputs as it has outputs) that could be used for 'simulating' a system. The idea would be that ...
Steve Harding's user avatar
3votes
0answers
41views

Algorithms for average reward reinforcement learning in continuous/general state-action space

I see that discounted reward reinforcement learning has been extensively studied in the literature. However, the average reward metric receives less attention, and it looks like algorithms for this ...
k2pctdn's user avatar
3votes
1answer
640views

Match two paragraphs of text

I'm building a friend finder app and I need to match people based on a paragraph of text. Here is an example of what I mean: Person A: I love walking and going to the beach, I also love reading and ...
Dom's user avatar
  • 31
2votes
1answer
61views

What are the most effective methods and tools for summarizing long-form content like articles, editorials, and discussion threads for an app?

With users expecting instantaneous information and no compromise on in-depth details, app developers are challenged to condense long-form content such as articles, editorials, and discussion threads ...
Anshu Kumar Gupta's user avatar
2votes
1answer
73views

Is there any interpretation method suitable for CNNs which do regression tasks?

I mainly tackle regression problems by CNNs, and want to find a reliable method to calculate the heatmaps for NN's results. However, I find almost all interpretation methods including CAM is used for ...
minghuisvn's user avatar
1vote
1answer
43views

What models/algorithms besides variational autoencoders can I use to transform a discrete input into a differentiable latent space?

Let's say I have a discrete input and want to transform it into a differentiable latent space. What models/algorithms besides variational autoencoders can I use?
postnubilaphoebus's user avatar
1vote
0answers
97views

What strategies are there to reduce the variance of the policy gradient estimator of the REINFORCE algorithm?

What strategies are there to reduce the variance of the policy gradient estimator of the REINFORCE algorithm? I know one possibility is to subtract a baseline as a running average of rewards from past ...
postnubilaphoebus's user avatar
0votes
1answer
160views

Machine Learning Methods commonly used when data are scarse

It is well-known that deep neural networks require lots of data to perform reliably and well. A commonly-cited statistic is that you need at least 10,000 examples per class for a classification ...
postnubilaphoebus's user avatar
2votes
0answers
37views

What are some non-RL-based approaches to solving a typical bin assignment problem?

What are some non-RL-based approaches to solving a typical bin assignment problem, i.e., given a set of items (can be multidimensional), find the bin/knapsack/target which best packs (with minimum ...
helloworld's user avatar
3votes
0answers
152views

Are there Reinforcement Learning algorithms specialized for the case $\gamma=0$?

I have a Reinforcement Learning problem where the optimal policy does not depend on the next state (ie gamma equals 0). I think this means that I only need an efficient exploration algorithm coupled ...
AJSV's user avatar

153050per page
close