Kolmogorov Arnold Networks, Parameter Reduction, B-splines, radial basis functions, layer normalization, attention mechanisms