Questions tagged [algorithm-request]
Use this tag when you're looking for an algorithm (in the context of artificial intelligence) to solve your specific problem.
113 questions
3votes
1answer
91views
Is DPG a policy-based method or an actor-critic method?
I have a question about whether the Deterministic Policy Gradient algorithm in it's basic form is policy-based or actor-critic. I have been searching for the answer for a while and in some cases it ...
2votes
1answer
72views
Is there an algorithm capable of telling knots and links apart?
I'd like to create some algorithm that can tell apart knots from links. A knot is made up of one line, a link with multiple. As input, we expect images as above. Is there an algorithm capable of ...
2votes
2answers
855views
Is there an algorithm that produces a uniform distribution over the set of trajectories with maximum reward sum?
I am not an expert in reinforcement learning. I am applying it to my field of study. I am training a model such that given a state, it predicts the probability of taking an action for every action ...
0votes
1answer
59views
Machine Learning Algorithm for identifying the factors contributing to academic performance of students
I have a dataset with several qualitative and quantitative attributes, including age, location (longitude, latitude), city, parent occupation, family size, GPA etc. My task is to find the attributes/...
0votes
0answers
30views
Implement tensor operation with mini batches instead of Matrix multiplication in forward pass
The forward pass in neural network can be written as g(Wx+b). W is the weights Matrix, x is the input vector and b is bias, and g the non linearity, the activation function. However x can have more ...
0votes
0answers
57views
Symmetrical AI (same inputs as outputs) for Modelling and Prediction
I was wondering if there is an algorithm that helps create 'symmetrical' AIs (an AI that has the same inputs as it has outputs) that could be used for 'simulating' a system. The idea would be that ...
3votes
0answers
41views
Algorithms for average reward reinforcement learning in continuous/general state-action space
I see that discounted reward reinforcement learning has been extensively studied in the literature. However, the average reward metric receives less attention, and it looks like algorithms for this ...
3votes
1answer
640views
Match two paragraphs of text
I'm building a friend finder app and I need to match people based on a paragraph of text. Here is an example of what I mean: Person A: I love walking and going to the beach, I also love reading and ...
2votes
1answer
61views
What are the most effective methods and tools for summarizing long-form content like articles, editorials, and discussion threads for an app?
With users expecting instantaneous information and no compromise on in-depth details, app developers are challenged to condense long-form content such as articles, editorials, and discussion threads ...
2votes
1answer
73views
Is there any interpretation method suitable for CNNs which do regression tasks?
I mainly tackle regression problems by CNNs, and want to find a reliable method to calculate the heatmaps for NN's results. However, I find almost all interpretation methods including CAM is used for ...
1vote
1answer
43views
What models/algorithms besides variational autoencoders can I use to transform a discrete input into a differentiable latent space?
Let's say I have a discrete input and want to transform it into a differentiable latent space. What models/algorithms besides variational autoencoders can I use?
1vote
0answers
97views
What strategies are there to reduce the variance of the policy gradient estimator of the REINFORCE algorithm?
What strategies are there to reduce the variance of the policy gradient estimator of the REINFORCE algorithm? I know one possibility is to subtract a baseline as a running average of rewards from past ...
0votes
1answer
160views
Machine Learning Methods commonly used when data are scarse
It is well-known that deep neural networks require lots of data to perform reliably and well. A commonly-cited statistic is that you need at least 10,000 examples per class for a classification ...
2votes
0answers
37views
What are some non-RL-based approaches to solving a typical bin assignment problem?
What are some non-RL-based approaches to solving a typical bin assignment problem, i.e., given a set of items (can be multidimensional), find the bin/knapsack/target which best packs (with minimum ...
3votes
0answers
152views
Are there Reinforcement Learning algorithms specialized for the case $\gamma=0$?
I have a Reinforcement Learning problem where the optimal policy does not depend on the next state (ie gamma equals 0). I think this means that I only need an efficient exploration algorithm coupled ...