### Is the temperature equal to epsilon in Reinforcement Learning?

This is a piece of code from my homework. # action policy: implements epsilon greedy and softmax def select_action(self, state, epsilon): qval = self.qtable[state] prob = [] if (self.softmax): # use Softmax distribution prob = sp.softmax(qval / epsilon) #print(prob) else: # assign equal value to all actions prob = np.ones(self.actions) * epsilon / (self.actions -1)…

if I define the architecture of a neural network using only dense fully connected layers and train them such that there are two models which are trained using model.fit() and GradientTape. Both the methods of training use the same model architecture. The randomly initialized weights are shared between the two models and all other parameters…

### How to process data in a data stream for a LSTM

How can a data stream for a RNN (LSTM) be handled, when the stream contains data sets belonging to different prediction classes? Training phase: I have trained a LSTM to predict a class out of a sequence of Letters. For the training phase I used a fixed data array where the beginning an the ending…

### How can one stack CNNs?

I know that ensembles can be made by combining sklearn models with a VotingClassifier, but is it possible to combine different deep learning models? Will I have to make something similar to Voting Classifiers?

### Can I solve this assignment problem with RL or AI planning, and if yes how?

I have a list of positive nonzero integers $L=[v_1,\dots,v_𝑛|v_𝑖\in Z^{\neq}]$ which sum up to $V=\sum_i v_i$. The list is not sorted, i.e., there’s no guarantee that $v_i\leq v_{i+1}\ \forall\ i.$ Each integer can be be assigned either to a set $S_1$ or a set $S_2$: equivalently, it can be labeled as $l_1$ or $l_2$. The…

### Pose Estimation Feasibility on IoT

I want to estimate hand poses and recognize gestures using an open source library like OpenPose on live video. Considering the fact that such libraries are very computationally intensive; how likely is it that it will run on a Raspberry Pi 4 using a Pi-Cam while giving me something above 15 fps? Assume that the…

### What is the difference between the concepts “known environment” and “deterministic environment”?

According to the book “Artificial Intelligence: A Modern Approach”, “In a known environment, the outcomes (or outcome probabilities if the environment is stochastic) for all actions are given.”, and in a deterministic environment, “the next state of the environment is completely determined by the current state and the action executed by the agent…”. What’s the…

### How to train FFNN with Q-learning?

I know that in any NN architecture, the input data are states, and at the output layer Q-functionality of each action. Tell me please, how to adjust all weights in this case?

### Is it possible to train a neural network with 3 inputs and 12 outputs?

The selection of experimental data includes a set of vectors of different dimensions. The input is a 3-dimensional vector, and the output is a 12-dimensional vector. The sample size is 120 pairs of input 3-dimensional and output 12-dimensional vectors. Is it possible to train such a neural network (in MATLAB)? Which structure of the neural…

### Efficient implementation of seperable convolution in tensorflow

It seems like the native implementation of separable convolution in tensorflow is not efficient. https://github.com/tensorflow/tensorflow/issues/12940 Is anyone aware how can we get an efficient implementation of separable convolution in tensorflow from somewhere? If not is there any working/efficient implementation of separable convolution in other libraries?