In this paper, we introduce a learning-based approach to optimize a joint constellation for a multi-user MIMO broadcast channel (T Tx antennas, K users, each with R Rx antennas), with perfect channel knowledge. The aim of the optimizer (MAX-MIN) is to maximize the minimum mutual information between the transmitter and each receiver, under a sum-power constraint. The proposed optimization method do neither impose the transmitter to use superposition coding (SC) or any other linear precoding, nor to use successive interference cancellation (SIC) at the receiver. Instead, the approach designs a joint constellation, optimized such that its projection into the subspace of each receiver k, maximizes the minimum mutual information I(Wk; Yk) between each transmitted binary input Wk and the output signal at the intended receiver Yk. The rates obtained by our method are compared to those achieved with linear precoders.