XGBoost-Deep Residual (FedXGB-ResNet) collaborative porosity prediction in a federated learning framework
Tianwen Zhao , Guoqing Chen , Junyan Li , Cong Pang , Palakorn Seenoi , Nipada Papukdee , Piyapatr Busababodhin
Journal of Seismic Exploration ›› 2026, Vol. 35 ›› Issue (2) : 201 -224.
Privacy protection and multimodal fusion represent significant challenges in the context of cross-institutional geological data collaboration. To address these challenges, this paper proposes a collaborative XGBoost-Deep Residual model (FedXGB-ResNet) within a federated learning framework to achieve high-accuracy porosity prediction. Through a heterogeneous federated integration architecture, the framework combines XGBoost’s efficient modeling of structured well-logging parameters (with a feature importance gain of 41.2%) with ResNet’s deep extraction of spatiotemporal features from seismic images (87% overlap in activation maps between training and validation sets). The model achieved R2 scores of 0.87 and 0.83 on North Sea and Bakken oilfield datasets, respectively, representing improvements of 12.7% and 9.3% over the traditional FedAvg-XGBoost baseline. The innovative dual privacy protection mechanisms—gradient obfuscation and Paillier encryption—suppressed the membership inference attack success rate to 13.7% and reduced gradient similarity to 0.19 at a noise scale of 1.5 (privacy budget = 0.75), while increasing communication time by only 23%. A dynamic feature distillation mechanism adaptively fuses multimodal features through gated attention units, narrowing the F1-score gap between high-porosity and low-porosity identification to 5%. Experiments demonstrated that the framework reduces the risk of privacy leakage by 84.7% while retaining 93.4% of the performance of a centralized model, offering a balanced solution for collaborative cross-domain geological data analysis in terms of accuracy, privacy, and efficiency.
Federated learning / Porosity prediction / XGBoost / Deep residual network / Multimodal fusion / Geological data analysis
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
/
| 〈 |
|
〉 |