When taking the derivative of vectors (and with matrices), do we treat "x^T" as "x" and if so, why can we do this? Based on this page, it seems as though we simply add the transpose to make sure we have the correct dimensions for matrix multiplication, but there should be a formulaic approach so that we can implement this in code, right?
When taking the derivative of vectors (and with matrices), do we treat "x^T" as "x" and if so, why can we do this? Based on this page, it seems as though we simply add the transpose to make sure we have the correct dimensions for matrix multiplication, but there should be a formulaic approach so that we can implement this in code, right?