Skip to main content

All Questions

Tagged with
0votes
0answers
19views

CLIPSeg: no change in performance metrics with a better convolutional decoder

I am training CLIPSeg on the Oxford IIIT pet dataset for semantic segmentation (3 classes: background, cat, dog). In short, what I do is I stick a decoder on the CLIP encoder. The encoder outputs: ...
0votes
0answers
32views

Why is training for the game 2048 not working well for me?

So i started training neural networks for the game 2048. Here is my code for using DQN algorithms to train. The gymnasium env code should be OK, so im not putting it here, but if you need, here is the ...
0votes
2answers
188views

How can I reduce the loss? Why do I have the high loss and why do I have the gradient?

I want to classify some images (there are about 200.000 images) with a CNN. But I get a very high loss, see figures: Loss over the hole training run Loss for each epoch It's confused me, that there ...
0votes
1answer
164views

How to improve the performance when no shuffling of dataloader is needed?

I'm currently doing some researches on video recognition. What I'm trying to do is like this paper. The idea is that: for processing a specific input video clip (shape: [T, C, H, W]), it needs ...
1vote
2answers
106views

How to Represent Boardless Board Game as Input to RL Model?

I am currently doing my thesis project by creating an Imitation Learning (IL) agent that learns to play the board game Hive, which lacks a traditional 2D board. Pieces are placed relative to one ...
2votes
1answer
124views

Custom Loss Function Traps Network in Local Optima

I am working with a feedforward neural network to fit the following simple function: N(1) = -1 N(2) = -1 N(3) = 1 N(4) = -1 But I don't want to use the Mean-...
0votes
0answers
60views

How to solve the issue with getting free ports in Pytorch DDP?

I am facing issues with getting a free port in the DDP setup block of PyTorch for parallelizing my deep learning training job across multiple GPUs on a Linux HPC cluster. I am trying to submit a deep ...
0votes
0answers
68views

Convert sciBERT to GGUF

I want to use the SciBERT weights in Ollama. Ollama accepts GGUF format, whereas SciBERT is in another format. I downloaded SciBERT from this huggingface link. I tried to convert it with llama.cpp ...
2votes
0answers
109views

Solving the 2D Helmholtz Equation over a sphere with a PINN

Background: I am working on implementing solutions to various partial differential equations through Physics-informed neural networks (PINNs). I have started working on the Helmholtz equation and was ...
0votes
0answers
38views

PPO with multiple actions per action vector

I would like to have the following vector for example [0.2,0.6,0.3,0.4,0.8] end up looking like this after training [0,1,0,0,1]. In other words , rather than choosing one action, I'm choosing more ...
0votes
0answers
59views

PyTorch and Opacus for Differential Privacy

When testing an example code from the TensorFlow website using Jupyter Notebook, which is available at the following link: [LINK_1], I encountered an error. You can find my question about that error ...
0votes
0answers
22views

Spikes in Loss During Training both train/val datasets with LSTM

I'm seeing good results I think, but I want to understand why these spikes in loss are occuring. As you can see, it would appear that my training is working as it should, but every 200 or so epochs ...
0votes
0answers
61views

SSIM in validation higher then SSIM in training for image denoising

I'm working to denoise microscopy images using a 2D U-Net. I'm training my network on images taken at different z-levels, and these images have ground truth, which is the mean of the images in z. ...
0votes
1answer
148views

How to Create a 1D Embedding from Tensors of Varying Sizes?

I am a newbie in AI and playing with some computer vision algorithms. I have three tensors with different sizes. Noise augmentation levels tensor with size (N, C, H, W), diffusion timestep tensor of ...
2votes
1answer
1kviews

How do I start implementing a deep learning paper from scratch?

I am fairly new to deep learning. I want to implement a deep learning paper from scratch with proper data preprocessing, model, losses etc., using an object-oriented approach in python. I want to do ...

153050per page
close