### What kind of enemy to train a good RL-agents

So I want to create an RL-agent for two players-board game. I want to use a simple DQN for the first player (my RL-agent). Then, what kind of algorithm that should I use on the second player (my RL-agent’s enemy)? I have three options in my mind: a random agent that act randomly a rule-based…

### How does the BERT model (in Tensorflow or Paddle-paddle frameworks) relate to nodes of the underlying neural-net that’s being trained?

The BERT model in frameworks like TensorFlow/Paddle-paddle shows various kinds of computation nodes (like subtract, accumulate, add, mult etc) in a graph like form in 12 layers. But this graph doesn’t look anything like a neural-network, one that’s typically shown in textbooks (e.g. like this https://en.wikipedia.org/wiki/Artificial_neural_network#/media/File:Colored_neural_network.svg) where each edge has a weight that’s being trained…

### Is there a good ratio between the positive and negative rewards in reinforcement learning?

Is there an ideal ratio in reinforcement learning between the positive and negative rewards? Suppose I have the scenario of moving a robot across the river. There are two options, walk across the bridge or walk across the river. If it walks across the river then the robot breaks so the idea is to reinforce…

### Combining Cross Entropy With Mean Squared Error

I am making an MNIST classifier. I am using categorical cross-entropy as my loss function. I want to make it so that if the correct label is 3, then it will penalize the model more heavily if it classifies a 4 than a 7 because 4 is closer numerically to 3 than 7 is. How…

### What does the arrow do in this equation?

I get the idea of what the equation does on the right of the arrow, I just wondered what the significance of the arrow is? I presume it’s not the same as ‘equals’?

### What does the arrow do in this equation?

I get the idea of what the equation does on the right of the arrow, I just wondered what the significance of the arrow is? I presume it’s not the same as ‘equals’?

### What does the arrow do in this equation?

I get the idea of what the equation does on the right of the arrow, I just wondered what the significance of the arrow is? I presume it’s not the same as ‘equals’?

### Is there an AI that can complete Deezer Spleeter work?

I have used Deezer Spleeter but it produces echoes aside the stems, so I wonder if there is already an AI that remove echoes noises.

### Indexing tensors in custom loss function with Keras

I’m using a custom loss function in Keras. This is the function: def custom_loss(groups_id_count): def listnet_loss(real_labels, predicted_labels): losses = tf.placeholder(shape=[None], dtype=tf.float32) # Tensor of rank 1 for group in groups_id_count: start_range = 0 end_range = (start_range + group[1]) batch_real_labels = tf.slice(real_labels, [start_range, 1, None], [end_range, 1, None]) batch_predicted_labels = tf.slice(predicted_labels, [start_range, 0, 0], [end_range, 0,…

### Whats Wrong With this Code?

“”” Visualize Genetic Algorithm to find a minimul point in a function. “”” import numpy as np import matplotlib.pyplot as plt DNA_SIZE = 50 # DNA length POP_SIZE = 1000 # population size CROSS_RATE = 0.8 # mating probability (DNA crossover) MUTATION_RATE = 0.003 # mutation probability N_GENERATIONS = 200 Mu_BOUND = [25, 40] #…