1) Machine learning algorithms aim to learn patterns in labeled data to predict labels for new data, while data mining describes patterns without guaranteed generalization.
2) Running machine learning on Hadoop has issues with iterations and data sparsity causing many small, empty files.
3) Techniques like speculation, grouping rare values, and sampling can improve performance by reducing iterations and sparsity when learning decision trees on Hadoop.