I am trying to find the time complexity of a binary decision tree algorithm. I have understood that at each node, the complexity is bounded by the complexity of searching the best attribute O(m nlog n) knowing that m is the number of features and n is the number of exemples in the training set. I think we should multiply O(m nlog n) by the number of nodes to find the complexity of the whole algorithm, is it? I don't understand why in some resources, the complexity of the decision tree is considered only O(m nlog n)!
Can anyone explain this? Is there any difference in the computation of the complexity, depending on whether categorical attributes or continuous are used?
 
     
     
    