
This experiment aims to investigate the impact of data distribution heterogeneity on the performance of federated learning algorithms. The experiment uses a fixed number of 3 clients, each representing a survey point. Seventeen heterogeneity levels (0-16) are designed to simulate data distribution scenarios ranging from completely homogeneous to extremely heterogeneous. The data allocation strategy assigns a target client to each species category. For example, 'zhaoshui' is assigned to client 2, 'xiaotuan' to client 0, and 'nizi' to client 1. Heterogeneity level 0 represents a completely even distribution, with each species category accounting for approximately 33% in each of the three clients. Heterogeneity levels 1-6 indicate that the proportion of the target category in the target client gradually increases from 43% to 90%, with the remaining samples evenly distributed to the other two clients. Heterogeneity levels 7-15 indicate that the proportion of the target category in the target client gradually increases from 91% to 99%, where only 8-10 samples of the other two species remain in each client besides the main species. Heterogeneity level 16 represents complete heterogeneity, with the target category accounting for 100% in the target client and 0% in the other two clients. The 'noise' category remains evenly distributed across all heterogeneity levels. The evaluation results for each heterogeneity level on an independent test set are shown in the table.
A schematic diagram in 16:9 format with a white background, ...