Digital Pathology and Artificial Intelligence: An Overview and Recent Updates

Zaibo Li

doi:10.2738/PR.2026.0005

Pathology Research ›› :1 -29. DOI: 10.2738/PR.2026.0005

Review

Digital Pathology and Artificial Intelligence: An Overview and Recent Updates

Zaibo Li

Author information +

History +

PDF (7624KB)

Abstract

Over the past decade, pathology has undergone a profound digital transformation. What began as the gradual adoption of whole slide imaging has rapidly evolved into a dynamic ecosystem that integrates high-resolution digital imaging, advanced computational analytics, artificial intelligence (AI), and increasingly, generative and multimodal models, and agentic AI. These developments are not merely technological upgrades; they represent a paradigm shift in how pathologists visualize disease, extract diagnostic and prognostic information, collaborate across distances, and contribute to precision medicine. This review aims to provide a comprehensive, practical, and forward-looking overview of this transformation and to bridge foundational concepts with real-world implementation of digital pathology workflow and emerging innovations such as multimodal foundation models and agentic AI in pathology.

Graphical abstract

Keywords

digital pathology / artificial intelligence (AI) / machine learning / deep learning / multimodal foundation model / agentic AI

Cite this article

Download citation ▾

Zaibo Li. Digital Pathology and Artificial Intelligence: An Overview and Recent Updates. Pathology Research 1-29 DOI:10.2738/PR.2026.0005

登录浏览全文

4963

注册一个新账户忘记密码

Introduction

Pathology sits at the center of modern medicine, providing definitive diagnoses that guide patient management, prognostication, and therapeutic decision-making. For more than a century, this discipline has relied on light microscopy and glass slides as its primary tools. While this paradigm has proven remarkably robust, it is increasingly challenged by rising case volumes, growing diagnostic complexity, subspecialization, workforce shortages, and escalating demands for accuracy, efficiency, and standardization. Against this backdrop, digital pathology (DP) and artificial intelligence (AI) have emerged as transformative technologies with the potential to fundamentally reshape pathology practice and its integration into clinical workflows.

DP involves the acquisition, management, sharing, and interpretation of pathology data in a digital environment. Whole-slide imaging (WSI) enables pathologists to review and analyze cases on computers, freeing them from traditional microscopes. While similar to radiology’s digital shift, DP faces unique challenges related to image size and complexity. Advances in scanning, storage, and software have driven its rapid adoption. Once limited to education and research, DP is now widely used in clinical practice, including primary diagnosis, with growing implementation across academic and community settings—accelerated by needs for remote work and resilience highlighted during the coronavirus disease 2019 (COVID-19) pandemic.

Concurrently, AI has advanced from experimental algorithms to clinically relevant tools that assist with detection, classification, quantification, and prognostic assessment. Deep learning (DL) models trained on whole-slide images can identify mitoses, grade tumors, quantify biomarkers, detect metastases, and reveal patterns linked to molecular and clinical outcomes. Rather than replacing pathologists, AI serves as an augmentative tool that improves consistency, reduces workload, and uncovers additional insights from pathology images.

The convergence of DP and AI represents more than a technological upgrade; it signals a paradigm shift in how pathology is practiced. Digital workflows enable seamless image sharing across institutions and disciplines, facilitating multidisciplinary team discussions, telepathology, and global consultation. AI-driven image analysis extends the diagnostic reach of pathology into quantitative and predictive domains, supporting personalized treatment strategies.

Despite this promise, the path to widespread adoption is complex. Implementing DP requires substantial investment, careful workflow redesign, robust validation, staff training, and adherence to regulatory and accreditation standards. Similarly, AI tools must be rigorously validated, transparently evaluated, and thoughtfully integrated into daily practice to ensure safety, usability, and clinical value. Ethical considerations including data privacy, algorithmic bias, accountability, and the evolving role of the pathologist must be addressed alongside technical innovation.

This review aims to provide a comprehensive, practical, and forward-looking overview of DP and AI as they relate to clinical workflow. It first explores the implementation of DP systems, and their clinical and non-clinical applications, and then delves into the fundamentals of AI and its clinical applications, and emerging developments such as generative AI (GAI), multimodal AI, and agentic AI.

Digital pathology

DP infrastructure

WSI systems

DP has evolved over nearly two centuries, from early photomicrography to telepathology and, more recently, WSI. A key milestone came in 2017, when the U.S. Food and Drug Administration approved the first WSI system for primary diagnosis, establishing DP as a clinically viable alternative to traditional microscopy. WSI scanners serve as the entry point to digital workflows, converting glass slides into high-resolution images by capturing and stitching sequential tiles. Modern WSI systems include two core components: image acquisition (slide scanning) and a workstation for image viewing and management. Scanner capabilities vary in capacity, throughput, and automation, with selection guided by case volume, specimen type, and turnaround needs.

Whole slide image scanners

Scanners can be broadly classified as high-throughput, medium-throughput, and single-slide systems[1,2].

High-throughput scanners digitize hundreds to thousands of slides per batch and are suited for large centers, with automated loading and barcode tracking. Medium-throughput systems support lower volumes with faster turnaround for targeted workflows such as frozen sections and consultations. Single-slide or on-demand scanners are used for rapid, flexible applications such as frozen sections, rapid on-site evaluation (ROSE) for cytology, and education, where immediate access is prioritized over batch efficiency.

Scanner speed depends on several factors, including magnification (20× vs. 40×), file format, focus algorithms, and use of z-stacking for thicker sections. Institutions adopting full digital sign-out should ensure scanner throughput aligns with daily slide volume, ideally enabling complete digitization within 24 h to avoid diagnostic delays.

Some DP systems have received U.S. FDA clearance/authorization for primary diagnosis (Table 1).

Whole slide imaging file formats

WSI files are large, often 1–3 GB per slide at 40×, making efficient compression essential for storage and network performance. Compression may be lossless or lossy; for primary diagnosis, lossless or minimally lossy formats are preferred, and settings should be validated to ensure diagnostic integrity. A longstanding challenge in DP is the variety of proprietary file formats (e.g., .svs, .ndpi, .mrxs, .iSyntax), each with vendor-specific structures, compression methods, and metadata.

To promote interoperability, the Digital Imaging and Communications in Medicine (DICOM) Working Group 26 extended the DICOM standard to encompass WSIs[3]. This standardization allows for consistent encoding of images, annotations, and metadata, supporting cross-platform compatibility and long-term accessibility[3]. Transitioning to DICOM requires infrastructure readiness and close vendor collaboration, but offers key advantages, including standardized data management, reduced vendor lock-in, and improved integration with radiology archives. As radiology and pathology converge within enterprise imaging strategies, DICOM compliance becomes increasingly important for creating unified patient imaging records[4,5].

Image quality and calibration

Diagnostic fidelity in DP depends on accurate color reproduction, image sharpness, and consistent focus. Regular scanner calibration using standardized color targets and focus slides is essential and should be built into routine maintenance, with quality control (QC) logs maintained for compliance[6–8]. Because pathologists are sensitive to color variation, especially in hematoxylin and eosin (H&E) staining, scanner software should support color normalization to reduce batch variability. Periodic quality assurance (QA) review of scanned slides is also important to detect artifacts such as stitching errors, debris, or incomplete scans.

Image management system (IMS)

Once slides are digitized, effective image management is critical to workflow success. The IMS serves as the central hub—analogous to the radiology Picture Archiving and Communication System (PACS)—enabling secure, efficient, and scalable storage and access to WSIs for clinical care, education, research, and QA.

A robust IMS must handle large files, support multiple formats, and integrate seamlessly with the laboratory information system (LIS), electronic medical record (EMR), and tools such as AI analytics and annotation platforms. This requires careful planning of database structure, network performance, caching, and access control. Key capabilities include rapid image access, smooth navigation, reliable linkage to LIS metadata, annotation tools, audit trails with role-based permissions, and multi-user collaboration. Several commercial IMS platforms are available, some with FDA clearance including Philips IntelliSite Pathology Solution, PathPresenter Platform, Sectra Digital Pathology Solution, PathAI AISight Dx, etc.

IMS performance and scalability

IMS performance is critical to user satisfaction, as pathologists expect rapid loading and smooth navigation without latency. This requires high-speed network connectivity and optimized streaming protocols. Modern systems use tile-based streaming, loading only the visible portion of an image to reduce bandwidth demands.

Scalability is equally important. As slide volumes grow, the IMS must efficiently manage millions of files. A modular architecture allows expansion through additional servers or cloud resources, and vendors should be assessed on their ability to support seamless horizontal scaling without service disruption.

Integration with LIS

The LIS remains the authoritative source of patient and specimen data in anatomic pathology. Seamless integration with the IMS ensures each digital slide is accurately linked to its case, block, and stain, typically via barcode identifiers. When a case is opened in the LIS, associated images should automatically load in the IMS, eliminating manual searching[9].

Bidirectional communication is essential: annotations and measurements from the IMS should be transferred to the LIS or reporting system, while case status updates synchronize across platforms. Effective integration reduces duplicate data entry and minimizes the risk of mismatched records and images.

Supporting AI integration and research

As AI advances, the IMS should be able to serve as a central platform for deploying inference tools, integrating automated image analysis into diagnostic workflows. Choosing an IMS that supports such extensions helps ensure future AI integration without major system redesign.

Beyond clinical use, WSI storage and management are critical for research, particularly in computational pathology and AI. Structured storage and metadata tagging enable efficient cohort selection and large-scale image retrieval for model development, while sandbox environments allow research access without compromising clinical systems or data integrity[10].

Data storage, security, and retention

Storage architecture is among the most technically demanding aspects of implementing DP. A fully digital laboratory can produce tens to hundreds of terabytes of image data each year, depending on case volume and scanning resolution. As a result, storage systems must carefully balance capacity, speed, redundancy, and cost[11,12].

Storage tiers and locations

Most institutions use a tiered storage model. Primary storage (Tier 1) consists of high-performance disks for active cases, prioritizing speed and reliability, often with redundant array of independent disks (RAID) for data protection. Secondary storage (Tier 2) offers slower but more cost-effective systems for recently completed cases. Archival storage (Tier 3) provides long-term, low-cost options—such as magnetic tape, optical media, or cloud-based cold storage—optimized for durability rather than retrieval speed. Institutional policies should specify how long images remain in each tier before migration, based on retention requirements and the likelihood of re-access.

Institutions must weigh the choice between on-premises and cloud-based storage. On-premises solutions offer greater control and lower latency but require substantial capital investment and ongoing maintenance, often using RAID arrays, storage area networks (SANs), or object-based systems with built-in data replication to protect against loss. Cloud storage provides scalable, flexible capacity and advanced redundancy with geographic replication, though institutions must ensure compliance with healthcare data protection regulations such as Health Insurance Portability and Accountability Act (HIPAA), General Data Protection Regulation (GDPR), or local equivalents. Hybrid models—combining high-speed on-premises storage for active cases with cloud archiving for long-term retention or disaster recovery—are increasingly common and cost-effective.

Data backup and security

Robust backup policies are essential to protect against data loss. Backups should be performed at regular intervals and stored in geographically separate locations, with automated verification processes to ensure integrity. Disaster recovery plans should define recovery time objectives (RTO) and recovery point objectives (RPO) tailored to clinical needs, and mission-critical systems should incorporate failover mechanisms to maintain access if the primary server fails.

Patient data within DP systems is subject to strict privacy regulations, including the HIPAA in the United States and the GDPR in Europe. Compliance requires a combination of technical, administrative, and physical safeguards. Technical measures include encryption of stored images (data at rest) and secure transmission channels (data in motion). Administrative controls involve access policies, staff training, and periodic security audits. Physical protections cover server room access restrictions, environmental controls, and redundant power and network systems. Institutions must also establish protocols for image anonymization and de-identification, particularly when sharing data for research or AI development. Metadata that could inadvertently identify patients—such as accession numbers, dates, or annotations—should be removed prior to export. Automated anonymization workflows help prevent data breaches while enabling ethical secondary use of DP data.

Data retention and archiving

Regulatory requirements determine how long pathology materials must be kept. In the U.S., the College of American Pathologists (CAP) mandates a minimum 10-year retention for glass slides and blocks[13], but no formal guideline yet exists for digital Whole slide images (WSIs). As a result, institutions set their own policies: some delete images after 3–6 months, while others keep them much longer to support research and innovation.

As DP adoption grows, managing the balance between image accessibility and storage costs has become increasingly important, with storage expenses rising annually. Effective WSI life cycle management (WSI-LCM) policies are therefore essential. Key considerations include the purpose of retention, required duration, access speed, and storage cost. Common retention strategies include: (1) Clinical diagnostic cases: retained for up to 3 months; require rapid, high-performance storage. After sign-out, these cases can be migrated to secondary storage. (2) Prior cases after sign-out: retention varies by subspecialty, averaging 2–3 years; moderate access times are acceptable with slower, cost-effective storage. (3) Educational cases: retained indefinitely with fast access to support teaching needs. (4) Legal/regulatory cases: often stored for up to 10 years; rarely accessed, suitable for slow, low-cost archival storage.

Network architecture and performance

The transition to DP significantly increases network demands. WSIs, even when compressed, are substantially larger than typical clinical data files, requiring networks with sufficient bandwidth, low latency, and high reliability. Bandwidth needs depend on case volume, image size, and the number of concurrent users. As a guideline, institutions should provide at least 1 Gbps connections between scanners, storage servers, and pathologist workstations, with 10 Gbps backbones preferred in high-volume settings. Network performance should be evaluated under peak load to identify potential bottlenecks.

Even minor latency can impact pathologist’s productivity. To address this, the IMS often employ local caching, temporarily storing recently accessed images on workstations or local servers for near-instantaneous reload during viewing sessions. Network optimization techniques—such as content delivery networks (CDNs) or distributed caching—can further enhance performance across geographically dispersed campuses. Because DP involves protected health information (PHI), network security must meet strict regulatory standards. Access controls ensure that only authorized personnel can view, annotate, or modify images. Role-based access control (RBAC) assigns permissions according to user roles, while audit trails log every access event to support accountability and compliance.

For remote access, secure virtual private networks (VPNs) or institution-managed web portals are used. Mandatory safeguards include multi-factor authentication, encryption in transit (HTTPS/TLS), and data integrity verification.

Diagnostic workstations

The quality of the viewing workstation directly affects the diagnostic experience. Pathologists transitioning from microscopes to digital monitors require high-resolution, color-calibrated displays that accurately reproduce histologic detail. A fully digital workflow also necessitates dedicated diagnostic displays at each pathologist’s workstation. DP displays can be categorized as medical grade (MG), professional grade (PG), or consumer off-the-shelf (COTS). Recently, pathology-specific MG displays have received FDA approval, and preliminary studies have benchmarked these instruments for primary diagnosis. Other studies have compared the performance of PG and COTS displays against MG displays[14–17].

Diagnostic monitors generally require at least 27-inch screens with 4K resolution or higher, providing a sufficient field of view for efficient navigation. Color calibration should meet medical imaging standards, and luminance should remain consistent across viewing sessions. Some institutions implement dual- or multi-monitor setups, dedicating one screen for image viewing and another for the LIS or report interface to enhance multitasking efficiency. Digital sign-out also changes the physical posture of pathologists, shifting from microscope-based to screen-based work. Proper ergonomic arrangements including adjustable monitor height, supportive chairs, and optimal lighting—help reduce fatigue and prevent musculoskeletal strain. Training in ergonomic practices should be incorporated into onboarding for all digital users[18].

Annotation tablets, programmable keyboards, and high-precision mice can enhance navigation speed and user comfort. Some pathologists prefer trackballs or touch interfaces for faster panning and zooming[19]. Institutions should accommodate individual preferences to optimize productivity and user satisfaction[20].

Information technology (IT) support, maintenance, and lifecycle management

DP systems demand ongoing IT support well beyond traditional histology operations. Dedicated IT personnel—ideally embedded within the pathology department—should manage hardware maintenance, software updates, and user support[11]. Routine tasks include monitoring scanner performance and updating firmware, verifying backups and storage integrity, applying security patches and antivirus updates, and reviewing system logs for errors or performance issues. Lifecycle management encompasses planning hardware refresh cycles (typically every 3–5 years), ensuring compatibility between legacy and new systems, and budgeting for software upgrades. A long-term maintenance roadmap helps prevent sudden obsolescence and ensures consistent user experience.

Clinical implementation of digital workflow

Although infrastructure and technology provide the foundation for DP, true success relies on seamless integration into daily laboratory workflows. This requires re-engineering traditional processes to include slide scanning, digital case assignment, and electronic sign-out, ensuring that digitization improves diagnostic efficiency without disruption[9,21–25].

DP workflows can be broadly classified into two models: partial digital implementation, in which only selected subspecialties (for example, breast pathology, gastrointestinal pathology or genitourinary pathology) or use cases are digitized (for example, consultation, frozen sections, or education); and full digital implementation, in which all routine diagnostic slides are scanned and reviewed digitally. Both models offer unique advantages and challenges. Partial implementation allows laboratories to pilot the technology, build confidence, and identify bottlenecks before large-scale deployment. Full digital adoption, while more ambitious, enables seamless integration with other digital health systems and unlocks efficiencies in workload distribution, telepathology, and computational analytics.

Institutions pursue DP for diverse reasons. Many aim to enhance diagnostic efficiency, reduce turnaround times, and support remote sign-out. Others seek to improve patient safety through standardized workflows and traceable digital records. Additionally, DP provides a foundation for future innovations, including AI-assisted diagnostics, computational image analysis, and precision oncology. Regardless of the initial motivation, successful implementation requires alignment among multiple stakeholders, including pathologists, laboratory managers, IT teams, hospital administrators, and regulatory authorities.

Historically, pathology has lagged behind other medical disciplines in digital transformation, largely due to the technical and logistical challenges of digitizing glass slides at diagnostic resolution. Nevertheless, several landmark institutional initiatives have shown that DP is both feasible and beneficial when carefully planned and executed[11,18,26–37]. Large academic institutions have reported measurable gains in workflow efficiency, cost savings through reduced slide handling and courier services, and improvements in multidisciplinary collaboration[11,12,38–40]. These experiences offer invaluable insights into the operational and strategic considerations necessary for successful adoption. Over more than two decades of published work, early adopters have demonstrated that DP is a transformative yet inherently complex endeavor. Key findings from these pioneers are summarized in Table 2.

A key lesson from early adopters is the critical importance of infrastructure readiness. Implementing a DP workflow requires a robust network capable of handling terabyte-scale data, secure and redundant storage, and scalable image management software that integrates seamlessly with existing LISs. Equally essential is clinical validation to ensure diagnostic equivalence between digital and optical modalities, supported by rigorous QA protocols, standard operating procedures, and compliance with guidelines from professional bodies such as the CAP, the Digital Pathology Association (DPA), and relevant international agencies[24,41–44].

Equally important is the human factor. Digital transformation reshapes how pathologists work, communicate, and interact with their environment. Training programs must address both the technical use of digital systems and the cognitive adaptation required for interpreting digital images. User acceptance, workflow redesign, and sustained leadership support are critical to success. Institutions that have achieved smooth transitions often emphasize early and continuous engagement of pathologists in planning, system selection, and validation.

DP should also be viewed as a long-term investment rather than a one-time project. Beyond initial hardware and software acquisition, ongoing costs include system maintenance, data storage expansion, software updates, and cybersecurity. A comprehensive business case is essential to secure institutional funding and demonstrate the economic value of DP, whether through direct operational savings or indirect benefits such as improved efficiency, faster turnaround times, and enhanced academic and research capacity.

A comprehensive roadmap for implementing a digital workflow in clinical surgical pathology is illustrated in Fig. 1. Based on institutional case studies and published evidence, it outlines the technical, operational, and human dimensions of digital adoption. Each section addresses a core domain of implementation, from strategic planning to technical infrastructure, workflow design, system integration, validation, staffing, training, adoption, and continuous quality improvement. This roadmap provides a practical framework adaptable to institutions of varying size, complexity, and readiness, helping ensure that DP delivers sustainable improvements in diagnostic practice.

Designing a digital slide workflow

The quality of a digital image starts with the glass slide. Slides must be clean, uniformly coverslipped, and accurately labeled or barcoded, as even minor imperfections—such as excess mounting medium, debris, or air bubbles—can distort scanning. Barcode labeling is critical: each slide should carry a 2D barcode encoding the accession number and slide ID to enable automatic linkage to the case record. Standardizing label placement ensures reliable recognition by scanner cameras. Modern histology laboratories often integrate immunohistochemistry (IHC) staining into routine workflows. Ideally, whole-slide scanners are positioned between the H&E and IHC areas so slides can be scanned immediately after coverslipping. This placement minimizes manual handling, reduces the risk of loss or damage, and ensures temporal alignment between physical and digital slides. In high-throughput settings, dedicating scanning personnel helps maintain continuous operation, rapid troubleshooting, and consistent QC[11,12,29].

Slides are typically loaded into racks or cassettes, verified by barcode, and automatically queued for scanning. Modern scanners include automated tissue detection, focus mapping, and calibration algorithms, but technician oversight remains essential. All scanned fields should be reviewed to ensure proper focus and image fidelity. Key QC checkpoints include focus accuracy, scan completeness, absence of scanning or staining artifacts, and appropriate image color balance.

The IMS assigns a unique digital identifier to each image, linking it to metadata such as case number, tissue type, stain, and block or slide ID. Consistent naming conventions are critical to prevent mismatches, and ideally, metadata is imported directly from the LIS via Health Level Seven (HL7) interfaces to avoid manual entry errors.

After scanning, digital files are automatically uploaded to the IMS, where checksum verification confirms file integrity, followed by thumbnail generation and indexing. In high-volume laboratories, this process is parallelized across multiple scanners and servers. Some labs also employ automated dashboards that display scanning progress, error rates, and throughput metrics in real time, enabling supervisors to quickly identify bottlenecks, such as scanner downtime or poor slide quality.

LIS-IMS synchronization

Integration between the LIS and the IMS forms the backbone of a digital workflow. Without reliable synchronization, pathologists cannot seamlessly access digital slides within their case worklists. When a case is accessioned in the LIS, a unique case ID is generated. During scanning, barcode data link each digital image to this case ID, enabling automatic association. Once ingestion is complete, the IMS sends a confirmation message to the LIS, indicating that the slides are available for review. In a fully integrated environment, the pathologist can open the case in the LIS and launch the digital viewer directly, which loads all corresponding slides in the correct order with the appropriate stain identifiers[11,12,31]. Two-way communication between the LIS and IMS enables real-time updates on case status, such as “in scan”, “available”, or “awaiting QC”. This transparency supports efficient workload management and helps prevent premature case assignment.

Hybrid workflows

Few institutions transition immediately to full digital operations; most begin with hybrid workflows that combine glass and digital review. Hybrid approaches commonly include: (1) pilot subspecialties, where digital workflows are initially implemented in areas that benefit most, such as dermatopathology or gastrointestinal pathology; (2) remote consultations, using digital slides for second opinions while maintaining glass archives; and (3) academic or training deployments, introducing digital systems in educational settings before diagnostic rollout[11,12,18,22,27–32,35,37,45,46].

Each scanning batch should undergo systematic QC inspection. This includes verifying focus, color fidelity, completeness of tissue capture, and absence of artifacts. Institutions often designate dedicated QC personnel who review thumbnail grids or random samples[47–49].

Validation and QA

Validation and QA are essential for DP. Unlike traditional microscopy, digital workflows introduce variables including scanners, image formats, displays, network performance, and software that can affect diagnostic accuracy. Robust validation and ongoing QA ensure that WSIs faithfully replicate optical review, support regulatory compliance, and maintain diagnostic confidence.

Validation

Validation in DP covers both technical and clinical performance. Technical validation evaluates hardware and software, including image fidelity, scanner calibration, focus accuracy, color reproduction, tissue coverage, and scanning speed. Test slides representing typical tissue types, staining variations, and known artifacts are scanned, and deficiencies prompt calibration, software adjustment, or protocol modification. Repeated scans under varying conditions (operator, scanner, or time of day) should yield consistent results[27,30,42,49,50].

Clinical validation ensures pathologists can render accurate diagnoses on WSIs compared to glass slides. Studies should include the full spectrum of routine cases, with subspecialty-specific validation as needed. Pathologists independently review cases in both digital and glass formats, with concordance rates calculated to confirm diagnostic equivalence. Systematic discrepancies must be addressed, and a minimum washout period between readings is recommended. For telepathology or remote sign-out, validation should replicate expected network conditions and display setups to maintain accuracy across locations.

Regulatory and professional guidelines shape the validation process. Institutions typically refer to standards set forth by bodies such as the CAP in the US (Table 3), the Royal College of Pathologists (RCPath) in the UK or other regional authorities/accreditation agencies[41,42].

QA

QA is essential to maintaining diagnostic integrity in DP. Digitization introduces variables absent in conventional microscopy, requiring systematic monitoring of image quality. Technologists and pathologists perform daily or batch-level inspections to detect focus errors, blurring, color distortions, tissue truncation, and missing sections. Barcode accuracy and metadata integrity are also verified to ensure proper case assignment and seamless LIS integration. Modern scanners and imaging platforms increasingly include automated QC algorithms that detect focus or registration deficiencies, allowing timely corrective action before review[47–49,51].

Monitoring rescan rates provide an important operational metric, as elevated rates may indicate issues in tissue processing, coverslipping, staining, scanner calibration, or operator technique. Display calibration is critical for accurate color, brightness, and contrast rendering, with photometric or colorimetric tools ensuring consistent viewing across workstations.

QA extends to DP software and systems, including functional validation, regression testing, bug tracking, version control, and formal change-management policies to prevent workflow disruptions. Data integrity is supported by comprehensive audit trails documenting scanning times, operators, access logs, annotations, modifications, case assignments, and sign-out timestamps, enabling root-cause analysis when errors occur.

A structured error-management framework addresses inevitable scanning, data handling, or interpretive errors. This includes prompt identification via automated alerts or QC checkpoints, classification of errors, corrective actions such as rescanning or retraining, and preventive strategies guided by trend analysis.

Ongoing performance monitoring using quantitative metrics—diagnostic concordance, scanner uptime, image rejection/rescan rates, digital turnaround time, and user satisfaction—ensures that DP systems continue to meet clinical, operational, and regulatory standards.

Clinical adoption

Resistance to change is a common barrier in digital transformation. Pathologists and laboratory staff accustomed to glass-slide practice may question digital accuracy or workflow efficiency, making effective change management essential. Institutions should engage stakeholders early, communicate benefits such as faster turnaround, remote access, and AI integration, and use pilot projects to demonstrate tangible improvements.

Leadership fosters adoption by recognizing early adopters, sharing successes, celebrating milestones, and providing channels for feedback. Structured strategies—early engagement, transparent communication, targeted training, pilot projects, and “digital champions” mentoring colleagues—further support acceptance.

Most institutions implement DP in phases[11,12,29]:

•Pilot phase: Limited deployment, often in one subspecialty or consultation cases, to assess technology, training, and workflow impact.

•Hybrid phase: Expanded scanning for selected routine cases, combining glass and digital workflows, identifying bottlenecks, validating systems, and refining standard operating procedures (SOPs).

•Full digital phase: Transition all diagnostic cases to digital, supported by established QC measures, IT infrastructure, and trained staff.

Each phase includes performance monitoring, staff feedback, and iterative improvement, reducing risk and building confidence. Adoption can be evaluated using metrics such as digital case volume, turnaround time, user satisfaction, costs, rescan rates, diagnostic concordance, and QC compliance, ensuring alignment with institutional goals and guiding optimization.

AI in pathology

Introduction of AI in pathology

AI is rapidly transforming diagnostic pathology[52,53]. As a discipline grounded in interpreting complex tissue morphology, pathology has traditionally relied on manual expertise developed over years of training. Increasing case complexity, the rise of precision medicine, and a global workforce shortage now create a strong need for computational support. AI addresses this gap by performing pattern recognition, decision-making, and learning tasks at scale and speed[52,54].

Unlike structured data (e.g., laboratory values), histopathology images are inherently unstructured. WSIs consist of millions of pixels without inherent semantic meaning—computers “see” only pixel arrays, not biological entities such as nuclei or tumors. This lack of structure limits direct computational interpretation. To enable analysis, visual patterns must be translated into quantitative features[55]. This involves segmentation (identifying regions), feature extraction (e.g., nuclear size, texture), and classification (assigning biological meaning). AI bridges this gap by converting pixel data into structured outputs—such as tumor grade, biomarker expression, or prognostic scores—thereby supporting automated diagnosis, risk prediction, and treatment planning.

Every pathology image begins as a grid of pixels—rich in color but devoid of meaning. While pathologists recognize nuclei, glands, and tumor architecture, computers see only numerical arrays. Bridging this gap requires a structured pipeline that converts raw pixels into clinically actionable insights.

Annotation and labeling provide the foundation. Expert pathologists define “ground truth” by marking key structures (e.g., tumor regions, mitoses), enabling supervised learning and linking biological knowledge to computational models[56]. Segmentation partitions images into meaningful regions—tumor, stroma, necrosis, lymphocytes—so that context-specific analysis becomes possible. This step underpins tasks such as tumor grading and biomarker quantification[57–65]. Registration aligns multi-modal images (e.g., H&E and IHC), ensuring that corresponding regions match across slides and enabling integrated analysis of morphology and molecular markers[66–69]. Color normalization reduces variability from staining and scanning differences, improving model robustness and consistency across datasets[70–73]. Feature extraction then converts visual patterns into quantitative representations. Traditional approaches rely on engineered features (e.g., nuclear size, texture), whereas DL learns hierarchical features directly from pixel data, often revealing patterns beyond human perception[74–76]. Finally, model outputs translate these features into structured results—tumor classification, biomarker levels, or prognostic predictions—supporting diagnosis and treatment decisions[77–88]. In essence, this pipeline transforms unstructured images into structured knowledge, enabling scalable, reproducible analysis and advancing precision pathology.

Overview of machine learning (ML) and DL

AI in pathology is driven primarily by two approaches. ML uses handcrafted features (e.g., nuclear size, texture, shape) defined by experts to train models on labeled data (Fig. 2A). DL uses neural networks to learn hierarchical features directly from raw pixels, often uncovering patterns beyond human perception. This distinction has practical implications: DL typically requires larger datasets, ML is more interpretable, and both must meet clinical expectations for transparency and reproducibility.

ML: handcrafted intelligence

ML learns from annotated datasets by converting images into predefined quantitative features. These features enable models to classify patterns—such as distinguishing mitotic from non-mitotic cells—based on measurable attributes. Because inputs are explicit and biologically meaningful, ML models are generally more interpretable and well-suited for tasks with defined criteria (e.g., tumor grading, biomarker quantification). Learning in ML is essentially mapping patterns to meaning. In supervised learning, models are trained on labeled examples (e.g., annotated slides), enabling accurate, task-specific predictions. In unsupervised learning, models identify inherent patterns without labels, supporting discoveries such as identifying novel histologic subgroups. Together, they balance reliability and innovation[89].

DL: automated feature discovery

DL represents a shift from manual feature design to data-driven learning. Neural networks trained on large image datasets automatically learn features associated with diagnostic patterns such as tumor architecture or mitotic activity without explicit instructions[90,91]. This enables detection of subtle, previously unrecognized signals but often at the cost of interpretability (“black box” behavior). Architecture represents network design (e.g., CNN, Vision Transformer) while a model is an architecture trained on data for a specific task. Common DL architectures in pathology include: (1) Convolutional Neural Networks (CNNs): detect local features; widely used for segmentation and classification[92,93]; (2) Vision Transformers (ViTs): capture global context for tasks like grading[94,95]; (3) Graph Neural Networks (GNNs): model cell–cell interactions and tissue architecture[96,97]; (4) Recurrent Neural Networks (RNNs)/Long Short-Term Memory networks (LSTMs): analyze sequential data (e.g., longitudinal samples, reports)[98,99]; and (5) multimodal models: integrate images with genomic and clinical data. (Fig. 2B)

DL models can achieve high diagnostic accuracy yet often lack transparent reasoning, so-called “black boxes”. In clinical settings, this is problematic: predictions must be explainable, defensible, and aligned with medical logic. Without interpretability, even accurate models risk limited trust and adoption.

This opacity stems from how DL works. Unlike traditional machine learning with explicit features (e.g., nuclear size, texture), DL learns abstract patterns from raw pixels. These features are encoded as numerical vectors without clear biological meaning, making it difficult to explain why a model predicts high grade or poor prognosis.

Explainability is therefore essential in pathology, where errors have direct clinical consequences and regulatory approval depends on transparency. To address this, several methods provide insight into model behavior: Saliency maps: highlight image regions influencing predictions[100]; Grad-CAM: generates heatmaps showing where the model “looked”[101,102]; Attention mechanisms: indicate which regions receive the most focus[85,103–105]. While not fully resolving interpretability, these tools help validate whether model decisions align with athology principles. Ultimately, explainability is a clinical necessity—enabling trust, regulatory acceptance, and effective collaboration between AI and pathologists.

Foundation models in pathology: the era of scalable intelligence

The emergence of foundation models marks a turning point in computational pathology. Traditionally, AI systems were developed for narrow tasks—such as mitosis detection, tumor grading, or biomarker quantification—each requiring dedicated datasets and training pipelines. In contrast, foundation models are trained on millions of image tiles from diverse WSIs, spanning tissue types and staining protocols. Models such as CTransPath, PLIP, UNI, Virchow, and CHIEF operate at an unprecedented scale, enabling them to learn generalizable representations of histopathology[106–112]. These representations act as universal building blocks that can be fine-tuned for specific tasks using minimal labeled data, substantially reducing the annotation burden that has long constrained AI development in pathology. Beyond scalability, foundation models offer strong adaptability. Through transfer learning, knowledge gained in one domain (e.g., breast cancer morphology) can accelerate learning in others, such as prostate cancer or rare sarcomas—an essential capability for the heterogeneous landscape of pathology. Moreover, multimodal models like PLIP extend this paradigm by integrating images with text, linking WSIs to pathology reports and clinical narratives. This convergence of visual and contextual data moves the field toward more holistic diagnostic systems and intelligent platforms that support precision medicine.

Current state of AI applications in clinical practice

To understand the practical impact of AI in pathology, imagine the daily challenges faced in a busy diagnostic lab. Each case brings unique complexities, some subtle, others glaring—and every decision carries clinical consequences. AI steps in as a digital assistant, not replacing the pathologist but amplifying their capabilities. Below are key areas where AI is making a difference, illustrated through real-world scenarios.

Classification and diagnosis

Precise classification of pathologic lesions is essential for appropriate treatment selection. AI is increasingly used to support primary diagnosis by analyzing complex morphologic patterns at scale. For example, in prostate biopsies, grading depends on glandular architecture—yet even experts may disagree in borderline cases. AI models trained on large annotated datasets can assess these patterns consistently, reducing variability. In breast pathology, CNNs analyze nuclear features and mitotic activity, achieving performance comparable to experienced pathologists and serving as a reliable second reader[75,82,113–117].

AI-based screening tools further enhance workflow efficiency. Systems such as the IBEX (Galen Breast) platform, trained on millions of labeled image patches, demonstrate high accuracy across lesion types while improving pathologist performance, reducing review time, and lowering unnecessary immunohistochemistry use[118,119]. Regulatory clearance of Paige Prostate Detect by the FDA represented a key milestone, accelerating the clinical deployment of AI for prostate cancer detection in core needle biopsies across the U.S. pathology practices[120–122]. Meanwhile, in Europe, CE-marked platforms—including Ibex’s Galen Prostate, Aiforia’s Prostate Cancer AI, and DeepDx—are seeing increasing adoption in routine laboratory workflows[123–126].

Overall, AI-assisted diagnostic classification has matured rapidly, with algorithms approaching or even exceeding expert-level performance in specific tasks. The greatest near-term utility lies in workflow augmentation, including triage of negative or high-probability malignant cases, pre-screening, and decision support for challenging diagnostic lesions.

Screening and detection

Lymph node metastasis

Detecting metastases in lymph nodes is critical for accurate staging in cancer, yet small tumor deposits can be easily missed. AI systems trained on large datasets can identify these subtle foci and flag suspicious regions, reducing false negatives and improving patient safety[80,83,85]. This challenge is especially relevant for axillary lymph nodes in breast cancer patients, where micrometastases and isolated tumor cells are difficult and time-consuming to detect. Studies show that AI significantly improves both sensitivity and efficiency. In the landmark CAMELYON16 challenge, top-performing algorithms achieved near-perfect accuracy (area under the curve [AUC] up to 0.994), in some cases exceeding the pathologist’s performance. AI assistance has also been shown to increase detection sensitivity and reduce interpretation time in clinical settings[127–130].

Commercial tools, such as the Visiopharm platform, now automate metastasis detection, measurement, and annotation on WSIs. These systems can achieve very high sensitivity while reducing review time, underscoring AI’s growing role in enhancing diagnostic accuracy and workflow efficiency[131]. Tools such as Paige’s PanCancer Detect and the Aiforia Colon Suite, are trained to recognize metastatic lesions in many tumors, including those arising from gastrointestinal, breast and other primaries[132,133].

Microcalcifications

Identifying mammary microcalcifications on H&E-stained stereotactic breast biopsy slides is challenging, as they are associated with a broad spectrum of benign, premalignant, and malignant conditions. In addition to detecting malignant and atypical lesions, the IBEX AI solution is designed to identify microcalcifications in breast biopsy WSIs. In one study, the algorithm achieved an AUC of 0.925 with 95% sensitivity, highlighting its potential to support microcalcification detection in routine practice[119].

Mitosis

Counting mitosis is a cornerstone of tumor grading, yet it is notoriously tedious. Scanning a large sarcoma or other tumor section for tiny mitotic spindles among thousands of cells can be exhausting, and fatigue or time pressure may cause figures to be missed. AI algorithms excel in this task, rapidly scanning entire slides and accurately pinpointing mitotic cells. This automation not only speeds workflow but also enhances reproducibility, which is crucial in high-stakes diagnoses[134–137].

AI-driven quantification of biomarkers and grading

IHC biomarkers

Tissue biomarkers are central to diagnosis, prognosis, and treatment selection. Traditionally, protein markers are evaluated by IHC and pathologists have historically interpreted these manually. With the advancement of DP, AI technology is now being used to provide more objective and standardized assessments. Key biomarkers in routine pathology practice, including estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2/neu), Ki-67, and programmed death-ligand 1 (PD-L1) have been the focus of many AI studies.

ER and PR are predictive of breast cancer prognosis and treatment response and are routinely assessed by IHC, typically as the percentage of positive tumor nuclei. Manual scoring is variable[138–146], but AI algorithms show strong agreement with pathologists and can calculate H-scores, improving reproducibility[12,147–155], though oversight remains necessary for faint staining or benign tissue[156].

HER2, overexpressed in about 20% of breast cancers and many other cancers such as endometrial serous carcinoma, gastrointestinal adenocarcinoma, and lung cancer, is assessed via IHC and/or in situ hybridization (ISH) per American Society of Clinical Oncology (ASCO)/CAP guidelines[157–160]. Manual HER2 scoring suffers from variability[161–163]. AI-based methods have demonstrated superior reproducibility and accuracy[164–169]. Algorithms evaluate membrane staining patterns or staining intensity, while AI-assisted microscopy incorporating augmented reality is further improving reliability[164,170–173]. Emerging research suggests that AI may help refine classification, such as distinguishing HER2-low subgroups[174–176].

Ki-67, a proliferation marker with prognostic significance in many tumors and in gastrointestinal neuroendocrine tumor grading[177–180], suffers from variable manual scoring[177,181–185]. AI can automate detection and positivity calculations across entire tumor areas for more consistent estimates[186–193].

PD-L1 IHC guides immune checkpoint therapy in lung cancer (Tumor Proportion Score [TPS]) and others (Combined Positive Score [CPS]), but scoring (TPS or CPS) is challenging manually. AI models can distinguish tumor and immune cells and compute TPS or CPS values matching expert assessments[194,195]. Emerging biomarkers, including multiplex panels and novel targets, are also poised to benefit from AI-based quantification, leveraging its ability to analyze complex staining patterns and spatial relationships.

Tumor grading

Consistent histologic grading in cancers is essential for prognosis and therapy planning. For example, the Nottingham system in breast cancer evaluates tubule formation, nuclear pleomorphism, and mitotic activity, but manual assessment is prone to interobserver variability. DL models improve reproducibility across all three components[37,196–206]. For instance, DL classifiers can automatically identify tubule-forming nuclei and calculate their ratio to total tumor cells, while automated pipelines detect mitotic hotspots and figures, enhancing proliferation assessment. Mitosis detection has been enhanced through automated DL pipelines that identify hotspots and mitotic figures, improving proliferation assessment accuracy. AI models for nuclear pleomorphism have also demonstrated superior reproducibility and prognostic stratification compared with traditional grading. In genitourinary pathology, AI achieves comparable performance with expert pathologists in grading prostate cancer (Gleason score) and clear cell renal cell carcinoma[207–215].

Tumor microenvironment

The tumor microenvironment, particularly tumor-infiltrating lymphocytes (TILs), is a strong prognostic factor in cancers, with high TIL levels linked to better survival and chemotherapy response[216,217]. Manual TIL assessment is subjective and variable, complicated by the complex spatial distribution of immune cells. DL algorithms improve detection and quantification, segmenting tumor versus stromal areas and calculating TIL density with high precision and reproducibility[218,219]. AI enables continuous scoring for finer stratification and systematic spatial profiling of immune cells, mapping their locations and interactions across entire slides. These spatial biomarkers provide additional prognostic and therapeutic insights, supporting research and clinical decision-making[220].

Prognosis, risk stratification, and prediction of treatment response

Perhaps the most exciting frontier is prediction. Imagine an H&E slide that not only confirms diagnosis but also forecasts a patient’s future—risk of recurrence, likelihood of therapy response, or underlying genetic alterations. AI models can bridge morphology and genomics, extracting subtle signals from tissue architecture to generate actionable prognostic insights. This approach transforms pathology from a descriptive discipline into a predictive science, fully aligning with the goals of precision medicine[77–82,84–88,117,221–224].

In breast cancer, histopathologic features contain rich prognostic information, traditionally assessed via histologic type, grade, tumor size, lymph node status, lymphovascular invasion, and biomarker expression[225–229]. Genomic assays like Oncotype DX, MammaPrint, and PAM50 guide adjuvant therapy decisions but are costly and not universally available. AI offers a scalable, cost-effective alternative by analyzing H&E slides to predict outcomes, recurrence risk, and molecular assay results. DL models, such as BCR-Net and Deep-ODX, can accurately score histologic features, correlate morphology with prognosis, and predict Oncotype DX scores with high consistency[84,86,200,225,227,230–235]. Commercial tools like Stratipath Breast (Stratipath AB) and RlapsRisk® BC (Owkin) demonstrate the clinical feasibility of AI-based risk stratification.

AI also enables prediction of treatment response by extracting features from tumor biology and the microenvironment. Multimodal approaches integrating histology with biomarkers, genomics, and imaging improve accuracy, supporting early therapy guidance[230,236–241].

AI applied to digital histopathology has emerged as a promising approach to infer molecular alterations directly from routine H&E slides, enabling “virtual molecular testing”. DL models can detect subtle morphologic correlations of molecular phenotypes, many of which are imperceptible to human observers. This capability has far-reaching implications for both research and clinical practice. By decoding complex biological patterns, AI is transforming breast cancer management toward more personalized, predictive care.

AI-integrated pathology workflow

Successful AI implementation in pathology requires strategic decisions about workflow. Automation opportunities include highly standardized tasks—grading, routine IHC quantifications, and straightforward frozen-section evaluations—where AI can improve efficiency, accuracy, and consistency. Processes requiring reinvention include QA, which may shift from retrospective audits to real-time AI-driven evaluation, and education and certification, which will increasingly embed AI competencies. Reports will become structured with AI-generated data, positioning pathologists to focus on verification, interpretation, and contextual integration. AI-generated summaries will enhance multidisciplinary communication. Human-centric roles to retain include integrated diagnostic reasoning, management of complex clinical scenarios, ethical decision-making, regulatory oversight, and patient-centered communication. Emerging capabilities include data-driven reports, advanced visualizations, predictive analytics, AI-driven case prioritization, and enhanced prognostic guidance.

Careful management of these transitions, with active pathologist involvement, ensures AI enriches rather than diminishes professional roles. Delegating routine tasks to AI allows pathologists to focus on complex cases, interdisciplinary collaboration, and clinical impact. Implementing AI demands structured planning, validation, and governance, guided by regulatory standards and stakeholder engagement. By following systematic approaches, pathology departments can transition to AI-enhanced practice while maintaining diagnostic excellence. Ultimately, the pathologist’s role evolves from diagnostician to diagnostic orchestrator, integrating diverse data into cohesive clinical narratives.

Ethics in AI adoption

Algorithmic bias in AI, particularly in pathology, largely arises from limitations in the datasets used for model training, including issues with sample selection, representation, and completeness of variables[242–244]. For example, datasets that disproportionately represent certain populations, such as adult white males due to accessibility and socioeconomic factors, can result in algorithms that do not generalize well to broader, more diverse patient groups, thereby potentially disadvantaging underrepresented populations. Another major concern is under-specification, where critical variables (such as genetic background or social determinants of health) are not included in the training data, leading to incomplete models and potentially misleading correlations in outcome prediction. This can result in erroneous assumptions, such as interpreting lower healthcare spending as an indicator of better health, when it may instead reflect limited access to care. These challenges underscore the importance of thoughtful dataset design and a deep understanding of both clinical and contextual factors when developing AI tools. Although it is not always practical for busy pathologists to master the complexities of statistical and algorithmic bias, maintaining awareness of these risks is essential when interpreting AI-assisted results. Addressing these issues requires a shared responsibility: regulatory agencies such as the U.S. Food and Drug Administration play a central role in evaluating and approving AI systems, while researchers, industry vendors, and pathologists must collaborate to identify, mitigate, and monitor sources of bias over time. In addition, professional organizations like the DPA and the CAP are actively contributing to guidance and best practices, helping ensure that AI is implemented in a way that is equitable, reliable, and clinically meaningful.

Patient privacy is a fundamental principle in the U.S. healthcare, centered on safeguarding PHI under the Health Insurance Portability and Accountability Act[245,246]. HIPAA establishes national standards for protecting PHI, grants patients’ rights over their health data, regulates electronic healthcare transactions, and enforces compliance through its privacy and security rules, overseen by the HHS Office for Civil Rights[245–248]. As AI technologies become increasingly integrated into healthcare and pathology, practitioners, especially those involved in decision-making, must maintain a working understanding of these regulatory requirements to ensure compliance and protect patient data.

New frontiers of AI in pathology

Multimodal foundation model

Multimodal foundation models represent a new paradigm in AI, extending beyond traditional large language models (LLMs) and image-based models. Trained on large, multimodal datasets including images, text, genomics, and clinical records, they learn generalizable representations that can be adapted to diverse tasks with minimal labeled data. Their architecture, typically built on large transformer backbones, enables integration and reasoning across multiple data modalities, particularly valuable in pathology, where diagnosis depends on the convergence of morphological, molecular, and multi-omics clinical data. This multimodal reasoning capability enables more comprehensive decision support, for example, correlating immunohistochemistry with morphological patterns, predicting genomic alterations from routine H&E slides, and generating differential diagnoses informed by clinical context.

The superiority of multimodal foundation models lies in four key areas: scalability and transferability, as pretraining on large, diverse datasets enables rapid adaptation with minimal labeled data; multimodal integration, allowing fusion of images, laboratory results, and clinical text for precision diagnostics; robustness and generalization, improving reliability across diverse populations and settings; and emergent reasoning, including in-context learning and cross-domain inference, which can generate novel biological insights and enhance clinical decision-making.

In practice, multimodal foundation models are poised to underpin next-generation pathology workflows, serving as the “central intelligence” layer. They can orchestrate image analysis, molecular prediction, automated reporting, and communication with clinicians and patients. By combining the narrative fluency of LLMs, the generative capabilities of diffusion models, and large-scale multimodal integration, they enable a shift from siloed analyses to integrated, patient-centered precision pathology, transforming data from images and clinical sources into diagnostic, prognostic, and predictive insights. Several emerging models illustrate this potential. PathChat applies LLM capabilities to WSIs, enabling interactive queries that link histopathologic findings with natural language explanations for education and decision support[249]. Virchow, a large vision foundation model trained on pathology images, demonstrates strong zero- and few-shot performance in cancer classification, survival prediction, and biomarker discovery[250]. Other models, such as CONCH[251] and PLIP[252], extend this paradigm through cross-modal representation learning that aligns images with text, enhancing retrieval and multimodal reasoning. The clinical utility of these models can be substantial. A pathologist could use a multimodal model as a “copilot,” integrating pathologic images with relevant clinical history to generate a tailored differential diagnosis and recommend appropriate ancillary tests to reach a final diagnosis.

Together, these advances show that foundation models are not only enhancing the accuracy and efficiency of diagnostic pathology but also enabling new workflows—such as natural language–driven exploration of whole-slide images, automated biomarker detection, and seamless integration with genomics and electronic health records[249–251,253–256]. These capabilities lay the groundwork for scalable, generalizable, and clinically deployable AI systems in precision pathology.

Agentic AI

While multimodal foundation models have been transformative, the next paradigm shift is the transition from reactive copilot to proactive, autonomous AI agent[257–259]. Agentic AI is defined by true agency: the ability to understand context, invoke appropriate tools, formulate complex goals, execute multi-step tasks through an iterative reasoning loop, and continuously monitor outcomes[260]. In healthcare, this could extend beyond answering queries to autonomously coordinating tasks such as test ordering or workflow management from a single high-level instruction[258,259]. In pathology, this shift marks a critical inflection point: from task-specific, assistive algorithms to fully agentic systems capable of orchestrating complex diagnostic workflows. By integrating modular architectures, persistent memory for gigapixel-scale WSI images, clinically grounded reasoning, agentic AI moves beyond passive detection to goal-directed execution. This evolution has the potential to redefine the pathologist’s role from manual analyst to high-level director, overseeing intelligent, adaptive, and end-to-end diagnostic processes.

Ferber et al. recently demonstrated an autonomous AI agent combining GPT-4 with precision oncology tools—ViTs for histopathology, MedSAM for radiology, and web-based resources (OncoKB, PubMed, Google). Across 20 multimodal cases, the system achieved 87.5% tool-use accuracy, 91.0% correct clinical conclusions, and 75.5% guideline citation accuracy—substantially outperforming GPT-4 alone (30.3% overall). This underscores the potential of integrated language model–driven systems to enhance personalized oncology decision support[261]. Similarly, SlideSeek exemplifies multi-agent AI in pathology[262]. Designed to analyze gigapixel WSIs autonomously, it mimics hierarchical human diagnostic reasoning: a Supervisor Agent formulates hypotheses and plans, while Explorer Agents use the PathChat multimodal model to examine slide regions. Iteratively refining its analysis, SlideSeek achieved 80.0% primary diagnosis accuracy on the DDxBench differential diagnosis benchmark, rivaling human-assisted systems that rely on pre-selected regions of interest (ROIs). This shift from reactive copilot to proactive, autonomous partner redefines the pathologist’s role from primary diagnostician to AI supervisor, with implications for training, workflow, and liability.

A defining advantage of agentic AI is its capacity to co-evolve with the pathologist[263]. Within a “pathologist-in-the-loop” model, the pathologist serves as an active supervisor rather than a passive end user. Corrections can be captured through active learning and converted into labeled training data. With lightweight fine-tuning, the system can quickly adapt to local laboratory practices or rare pathologic changes. This creates a continuous learning ecosystem in which AI performance improves alongside clinical expertise: the AI performs the computational heavy lifting, while the pathologist provides interpretive oversight and final diagnostic authority[263,264].

To ensure clinical safety, agentic AI must evolve beyond opaque “black box” outputs toward deliberate, self-evaluative reasoning. Pathology agents can implement an “Observe–Reflect–Refine” cycle, systematically reassessing intermediate outputs before finalizing a diagnosis. This will result in a transparent, “glass-box” system, which can be firmly anchored in authoritative clinical standards.

As AI transitions from a “co-pilot” to a true autonomous agent, accountability becomes paramount. Deploying agentic systems requires strong governance frameworks to ensure that greater autonomy does not dilute clinical responsibility. Large-scale data infrastructures must adhere to strict consent and privacy standards, while algorithmic bias should be actively monitored and mitigated through built-in reflective mechanisms.

The future of pathology lies in autonomous yet tightly integrated systems that augment, rather than replace, human expertise. By assuming routine, repetitive, and computationally intensive tasks, from ordering IHCs and quantifying biomarkers to drafting a diagnostic report, agentic AI can reduce workload and cognitive strain. In doing so, it reinforces the pathologist’s central role, enabling greater focus on complex diagnostic reasoning, multidisciplinary collaboration, and the delivery of precision care.

Conclusions

Pathology is entering a new era defined by the convergence of digital infrastructure, advanced analytics, and increasingly intelligent AI systems. The transition from glass slides to fully integrated digital workflows has not only improved efficiency and accessibility but has also expanded the scope of diagnostic insight through computational and multimodal approaches. Emerging technologies including multimodal foundation models and agentic AI further extend this trajectory, enabling more adaptive, context-aware, and scalable solutions that align closely with the goals of precision medicine.

Importantly, this transformation is not about replacing the pathologist, but about redefining and elevating their role. As these tools mature, the pathologist remains central: providing clinical judgment, oversight, and integration of complex data into meaningful patient care decisions. Realizing the full potential of digital and AI-enabled pathology will require thoughtful implementation, robust validation, and strong governance frameworks to ensure safety, transparency, and equity. Ultimately, the future of pathology lies in a synergistic partnership between human expertise and intelligent systems: one that enhances diagnostic accuracy, streamlines workflows, and advances personalized medicine on a global scale.

References

Publishing order | Descend order by publishing year | Descend order by cited within

[1]	Soliman J, Weiser K, Ahmed I, et al. Selecting high-throughput scanners for clinical use: a multicenter institution experience. Am J Clin Pathol. 2025;164(4):589-599.

[2]	Patel A, Balis UGJ, Cheng J, et al. Contemporary whole slide imaging devices and their applications within the modern pathology department: a selected hardware review. J Pathol Inform. 2021;12:50.

[3]	Clunie D, Hosseinzadeh D, Wintell M, et al. Digital imaging and communications in medicine whole slide imaging connectathon at digital pathology association pathology visions 2017. J Pathol Inform. 2018;9:6.

[4]	Kuzmak PM, Dayhoff RE. The use of digital imaging and communications in medicine (DICOM) in the integration of imaging into the electronic patient record at the Department of Veterans Affairs. J Digit Imaging. 2000;13(Suppl 1):133-137.

[5]	Singh R, Chubb L, Pantanowitz L, Parwani A. Standardization in digital pathology: supplement 145 of the DICOM standards. J Pathol Inform. 2011;2:23.

[6]	Inoue T, Yagi Y. Color standardization and optimization in whole slide imaging. Clin Diagn Pathol. 2020;4(1):10.15761/cdp.1000139.

[7]	Clarke EL, Treanor D. Colour in digital pathology: a review. Histopathology. 2017;70(2):153-163.

[8]	Bautista PA, Hashimoto N, Yagi Y. Color standardization in whole slide imaging using a color calibration slide. J Pathol Inform. 2014;5(1):4.

[9]	Fraggetta F, Caputo A, Guglielmino R, Pellegrino MG, Runza G, L’Imperio V. A survival guide for the rapid transition to a fully digital workflow: the “caltagirone example”. Diagnostics (Basel). 2021;11(10):1916.

[10]	Lianas L, Del Rio M, Pireddu L, et al. An open-source platform for structured annotation and computational workflows in digital pathology research. Sci Rep. 2025;15(1):28910.

[11]	Clarke B, Carment-Baker C, Bruce C, Hanna K, Yousef GM. Large scale implementation of DP for clinical diagnoses: experience, challenges, and lessons learned. Crit Rev Clin Lab Sci. 2026;63(2):109-123.

[12]	Lujan GM, Savage J, Shana’ah A, et al. Digital pathology initiatives and experience of a large academic institution during the coronavirus disease 2019 (COVID-19) pandemic. Arch Pathol Lab Med. 2021;145(9):1051-1061.

[13]	Bauer TW, Schoenfield L, Slaw RJ, Yerian L, Sun Z, Henricks WH. Validation of whole slide imaging for primary diagnosis in surgical pathology. Arch Pathol Lab Med. 2013;137(4):518-524.

[14]	Park S, Pantanowitz L, Parwani AV. Digital imaging in pathology. Clin Lab Med. 2012;32(4):557-584.

[15]	Abel JT, Ouillette P, Williams CL, et al. Display characteristics and their impact on digital pathology: a current review of pathologists’ future “microscope”. J Pathol Inform. 2020;11:23.

[16]	Clarke EL, Munnings C, Williams B, Brettle D, Treanor D. Display evaluation for primary diagnosis using digital pathology. J Med Imaging (Bellingham). 2020;7(2):027501.

[17]	Cazzaniga G, Mascadri F, Marletta S, et al. Benchmarking digital displays (monitors) for histological diagnoses: the nephropathology use case. J Clin Pathol. 2025;78(11):798-804.

[18]	Stathonikos N, Nguyen TQ, Spoto CP, Verdaasdonk MAM, van Diest PJ. Being fully digital: perspective of a Dutch academic pathology laboratory. Histopathology. 2019;75(5):621-635.

[19]	Alcaraz-Mateos E, Hernández-Gómez R, Rojas Calvente E, et al. Comparison of muscle activity while using different input devices in digital pathology. Rev Esp Patol. 2022;55(1):19-25.

[20]	Rogers J, Vedaraju Y, Hsu J, Kinskey J, Long SW, Christensen P. A comparative usability assessment of computer input devices for navigating digital whole slide images. J Pathol Inform. 2025;18:100449.

[21]	Ameisen D, Deroulers C, Perrier V, et al. Towards better digital pathology workflows: programming libraries for high-speed sharpness assessment of whole slide images. Diagn Pathol. 2014;9(Suppl 1):S3.

[22]	Fabián O, Švajdler M, Jirásek T. Integration of digital pathology workflow in the anatomic pathology laboratory. Cesk Patol. 2025;61(1):22-28.

[23]	Fraggetta F, Garozzo S, Zannoni GF, Pantanowitz L, Rossi ED. Routine digital pathology workflow: the Catania experience. J Pathol Inform. 2017;8:51.

[24]	Fraggetta F, L’Imperio V, Ameisen D, et al. Best practice recommendations for the implementation of a digital pathology workflow in the anatomic pathology laboratory by the European society of digital and integrative pathology (ESDIP). Diagnostics (Basel). 2021;11(11):2167.

[25]	Hartman DJ. Whole-slide imaging: clinical workflows and primary diagnosis. Adv Anat Pathol. 2020;27(4):236-240.

[26]	Stathonikos N, Veta M, Huisman A, van Diest PJ. Going fully digital: perspective of a Dutch academic pathology lab. J Pathol Inform. 2013;4:15.

[27]	Buck TP, Dilorio R, Havrilla L, O’Neill DG. Validation of a whole slide imaging system for primary diagnosis in surgical pathology: a community hospital experience. J Pathol Inform. 2014;5(1):43.

[28]	Thorstenson S, Molin J, Lundström C. Implementation of large-scale routine diagnostics using whole slide imaging in Sweden: digital pathology experiences 2006–2013. J Pathol Inform. 2014;5(1):14.

[29]	Ardon O, Reuter VE, Hameed M, et al. Digital pathology operations at an NYC tertiary cancer center during the first 4 months of COVID-19 pandemic response. Acad Pathol. 2021;8:23742895211010276.

[30]	Babawale M, Gunavardhan A, Walker J, et al. Verification and validation of digital pathology (whole slide imaging) for primary histopathological diagnosis: all Wales experience. J Pathol Inform. 2021;12:4.

[31]	Eloy C, Vale J, Curado M, et al. Digital pathology workflow implementation at IPATIMUP. Diagnostics (Basel). 2021;11(11):2111.

[32]	Temprana-Salvador J, López-García P, Castellví Vives J, et al. DigiPatICS: digital pathology transformation of the Catalan health institute network of 8 hospitals-planification, implementation, and preliminary results. Diagnostics (Basel). 2022;12(4):852.

[33]	Ferreira I, Montenegro CS, Coelho D, et al. Digital pathology implementation in a private laboratory: the CEDAP experience. J Pathol Inform. 2023;14:100180.

[34]	Daniel M, Nowak K, Vajpeyi R, et al. From microscopes to monitors: unique opportunities and challenges in digital pathology implementation in remote Canadian regions. Diagnostics (Basel). 2025;15(16):1983.

[35]	Iwuajoku V, Ekici K, Haas A, et al. An equivalency and efficiency study for one year digital pathology for clinical routine diagnostics in an accredited tertiary academic center. Virchows Arch. 2025;487(1):3-12.

[36]	Montezuma D, Monteiro A, Fraga J, et al. Digital pathology implementation in private practice: specific challenges and opportunities. Diagnostics (Basel). 2022;12(2):529.

[37]	Retamero JA, Aneiros-Fernandez J, Del Moral RG. Complete digital pathology for routine histopathology diagnosis in a multicenter hospital network. Arch Pathol Lab Med. 2020;144(2):221-228.

[38]	Ardon O, Asa SL, Lloyd MC, et al. Understanding the financial aspects of digital pathology: a dynamic customizable return on investment calculator for informed decision-making. J Pathol Inform. 2024;15:100376.

[39]	Lujan G, Quigley JC, Hartman D, et al. Dissecting the business case for adoption and implementation of digital pathology: a white paper from the Digital Pathology Association. J Pathol Inform. 2021;12:17.

[40]	Matias-Guiu X, Temprana-Salvador J, Garcia Lopez P, et al. Implementing digital pathology: qualitative and financial insights from eight leading European laboratories. Virchows Arch. 2025;487(4):815-826.

[41]

Evans AJ, Brown RW, Bui MM, et al. Validating whole slide imaging systems for diagnostic purposes in pathology: guideline update from the College of American Pathologists in collaboration with the American Society for Clinical Pathology and the Association for Pathology Informatics. Arch Pathol Lab Med. 2021;146(4):440-450.

[42]	Williams BJ, Treanor D. Practical guide to training and validation for primary diagnosis with digital pathology. J Clin Pathol. 2020;73(7):418-422.

[43]	Eloy C, Fraggetta F, van Diest PJ, et al. Digital transformation of pathology—the European Society of Pathology expert opinion paper. Virchows Arch. 2025;487(5):971-981.

[44]	Zarella MD, Bowman D, Aeffner F, et al. A practical guide to whole slide imaging: a white paper from the digital pathology association. Arch Pathol Lab Med. 2019;143(2):222-234.

[45]	Bruce C, Prassas I, Mokhtar M, et al. Transforming diagnostics: the implementation of digital pathology in clinical laboratories. Histopathology. 2024;85(2):207-214.

[46]	Flach RN, Fransen NL, Sonnen AFP, et al. Implementation of artificial intelligence in diagnostic practice as a next step after going digital: the UMC Utrecht perspective. Diagnostics (Basel). 2022;12(5):1042.

[47]	Ardon O, Labasin M, Friedlander M, et al. Quality management system in clinical digital pathology operations at a tertiary cancer center. Lab Invest. 2023;103(11):100246.

[48]	Chong Y, Bae JM, Kang DW, Kim G, Han HS. Development of quality assurance program for digital pathology by the Korean Society of Pathologists. J Pathol Transl Med. 2022;56(6):370-382.

[49]	Ho J, Parwani AV, Jukic DM, Yagi Y, Anthony L, Gilbertson JR. Use of whole slide imaging in surgical pathology quality assurance: design and pilot validation studies. Hum Pathol. 2006;37(3):322-331.

[50]	Hsu YR, Ahmed I, Phlamon J, et al. An adapted & improved validation protocol for digital pathology implementation. Semin Diagn Pathol. 2025;42(4):150905.

[51]	Kim YJ, Roh EH, Park S. A literature review of quality, costs, process-associated with digital pathology. J Exerc Rehabil. 2021;17(1):11-14.

[52]	Niazi MKK, Parwani AV, Gurcan MN. Digital pathology and artificial intelligence. Lancet Oncol. 2019;20(5):e253-e261.

[53]	Coudray N, Ocampo PS, Sakellaropoulos T, et al. Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning. Nat Med. 2018;24(10):1559-1567.

[54]	Campanella G, Hanna MG, Geneslaw L, et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat Med. 2019;25(8):1301-1309.

[55]	Senaras C, Niazi MKK, Lozanski G, Gurcan MN. DeepFocus: detection of out-of-focus regions in whole slide digital images using deep learning. PLoS One. 2018;13(10):e0205387.

[56]	Niazi MKK, Abas FS, Senaras C, et al. Nuclear IHC enumeration: a digital phantom to evaluate the performance of automated algorithms in digital pathology. PLoS One. 2018;13(5):e0196547.

[57]	Su Z, Wei Chen M, Sajjad U. Adapting segment anything model for tumor bud segmentation on hematoxylin and eosin images of colorectal cancer. Arch Pathol Lab Med. 2024;148(9):E183-E183.

[58]	Camalan S, Niazi MKK, Elmaraghy C, Moberly AC, Gurcan MN. Tympanic membrane segmentation of video frames to create composite images using SAM. In: Chen W, Astley SM, eds. Medical Imaging 2024: Computer-Aided Diagnosis. Vol 12927. SPIE; 2024.

[59]	Su Z, Chen W, Leigh PJ, et al. Few-shot tumor bud segmentation using generative model in colorectal carcinoma. In: Tomaszewski JE, Ward AD, eds. Medical Imaging 2024: Digital and Computational Pathology. Vol 12933. SPIE; 2024:129330A.

[60]	Su Z, Chen W, Annem S, et al. Adapting SAM to histopathology images for tumor bud segmentation in colorectal cancer. In: Tomaszewski JE, Ward AD, eds. Medical Imaging 2024: Digital and Computational Pathology. Vol 12933. SPIE; 2024:129330C.

[61]	Tavolara TE, Jorgensen AM, Gurcan MN, Murphy SV, Niazi MKK. Panoptic segmentation of wounds in a pig model. In: Mazurowski MA, Drukker K, eds. Medical Imaging 2021: Computer-Aided Diagnosis. Vol 11597. SPIE; 2021.

[62]	Binol H, Moberly AC, Niazi MKK, et al. SelectStitch: automated frame segmentation and stitching to create composite images from otoscope video clips. Appl Sci. 2020;10(17):5894.

[63]	Niazi MKK, Yazgan E, Tavolara TE, et al. Semantic segmentation to identify bladder layers from H&E Images. Diagn Pathol. 2020;15(1):87.

[64]	Tavolara TE, Niazi MKK, Beamer G, Gurcan MN. Segmentation of mycobacterium tuberculosis bacilli clusters from acid-fast stained lung biopsies: a deep learning approach. In: Proceedings of SPIE. Vol 11320. SPIE; 2020:113200E.

[65]	Niazi MKK, Yazgan E, Lee C, Parwani A, Gurcan MN. Identifying bladder layers from H and E images using U-Net image segmentation. In: Tomaszewski JE, Ward AD, eds. Medical Imaging 2020: Digital Pathology. Vol 11320. SPIE; 2020:1132006.

[66]	Tavolara TE, Niazi MKK, Ginese M, et al. Automatic discovery of clinically interpretable imaging biomarkers for Mycobacterium tuberculosis supersusceptibility using deep learning. EBioMedicine. 2020;62:103094.

[67]	Khan MK, Nyström I. A modified particle swarm optimization applied in image registration. In: 2010 20th International Conference on Pattern Recognition. IEEE; 2010:1302-2305.

[68]	Niazi M, Hedrich J, Ingela N. Image Registration using Particle swarm optimization approach. SSBA 2010; 2010.

[69]	Chen M, Ma H, Sun X, et al. Multimodal whole slide image processing pipeline for quantitative mapping of tissue architecture and tissue microenvironment. Npj Imaging. 2025;3:26.

[70]	Boschman J, Farahani H, Darbandsari A, et al. The utility of color normalization for AI-based diagnosis of hematoxylin and eosin-stained pathology images. J Pathol. 2022;256(1):15-24.

[71]	Salvi M, Branciforti F, Molinari F, Meiburger KM. Generative models for color normalization in digital pathology and dermatology: advancing the learning paradigm. Expert Syst Appl. 2024;245:123105.

[72]	Tellez D, Litjens G, Bándi P, et al. Quantifying the effects of data augmentation and stain color normalization in convolutional neural networks for computational pathology. Med Image Anal. 2019;58:101544.

[73]	Vahadane A, Peng T, Sethi A, et al. Structure-preserving color normalization and sparse stain separation for histological images. IEEE Trans Med Imaging. 2016;35(8):1962-1971.

[74]	Tavolara TE, Carreno-Galeano G, Gurcan MN, Lee SJ, Niazi MKK. Grading and localization of histological features for bioengineered kidney constructs. In: Medical Imaging 2021: Digital Pathology; 2021.

[75]	Niazi MKK, Keluo Yao, Zynger DL, et al. Visually meaningful histopathological features for automatic grading of prostate cancer. IEEE J Biomed Health Inform. 2017;21(4):1027-1038.

[76]	Lu MY, Williamson DFK, Chen TY, Chen RJ, Barbieri M, Mahmood F. Data-efficient and weakly supervised computational pathology on whole-slide images. Nat Biomed Eng. 2021;5(6):555-570.

[77]	Afzaal U, Su Z, Sajjad U, et al. HistoChat: instruction-tuning multimodal vision language assistant for colorectal histopathology on limited data. Patterns (N Y). 2025;6(8):101284.

[78]	Akbar AR, Sajjad U, Su Z, et al. CellEcoNet: decoding the cellular language of pathology with deep learning for invasive lung adenocarcinoma recurrence prediction. arXiv. Preprint posted online 2025. arXiv: 2508.16742.

[79]	Sajjad U, Akbar AR, Su Z, et al. Morphology-aware prognostic model for five-year survival prediction in colorectal cancer from H&E whole slide images. arXiv. Preprint posted online 2025. arXiv: 2510.14800.

[80]	Sajjad U, Rezapour M, Su Z, Tozbikian GH, Gurcan MN, Niazi MKK. NRK-ABMIL: subtle metastatic deposits detection for predicting lymph node metastasis in breast cancer whole-slide images. Cancers (Basel). 2023;15(13):3428.

[81]	She Y, Jin Z, Wu J, et al. Development and validation of a deep learning model for non-small cell lung cancer survival. JAMA Netw Open. 2020;3(6):e205842.

[82]	Su Z, Tavolara TE, Carreno-Galeano G, Lee SJ, Gurcan MN, Niazi MKK. Attention2majority: weak multiple instance learning for regenerative kidney grading on whole slide images. Med Image Anal. 2022;79:102462.

[83]	Su Z, Rezapour M, Sajjad U, Gurcan MN, Niazi MKK. Attention2Minority: a salient instance inference-based multiple instance learning for classifying small lesions in whole slide images. Comput Biol Med. 2023;167:107607.

[84]	Su Z, Niazi MKK, Tavolara TE, et al. BCR-Net: a deep learning framework to predict breast cancer recurrence from histopathology images. PLoS One. 2023;18(4):e0283562.

[85]	Su Z, Rezapour M, Sajjad U, Niu S, Gurcan MN, Niazi MKK. Cross-attention-based saliency inference for predicting cancer metastasis on whole slide images. IEEE J Biomed Health Inform. 2024;28(12):7206-7216.

[86]	Su Z, Rosen A, Wesolowski R, Tozbikian G, Niazi MKK, Gurcan MN. Deep-ODX: an efficient deep learning tool to risk stratify breast cancer patients from histopathology images. In: Medical Imaging 2024: Digital and Computational Pathology; 2024; San Diego, California, United States.

[87]	Su Z, Guo Y, Wesolowski R, et al. Computational pathology for accurate prediction of breast cancer recurrence: development and validation of a deep learning-based tool. Mod Pathol. 2025;38(12):100847.

[88]	Tavolara TE, Su Z, Gurcan MN, Niazi MKK. One label is all you need: interpretable AI-enhanced histopathology for oncology. Semin Cancer Biol. 2023;97:70-85.

[89]	Long X, Wang T, Kan Y, et al. Pseudo training data generation for unsupervised cell membrane segmentation in immunohistochemistry images. In: 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM); 2024; Lisbon, Portugal: IEEE.

[90]	Bulten W, Kartasalo K, Chen PC, et al. Artificial intelligence for diagnosis and gleason grading of prostate cancer: the PANDA challenge. Nat Med. 2022;28(1):154-163.

[91]	Srinidhi CL, Ciga O, Martel AL. Deep neural network models for computational histopathology: a survey. Med Image Anal. 2021;67:101813.

[92]	Khosravi P, Kazemi E, Imielinski M, Elemento O, Hajirasouliha I. Deep convolutional neural networks enable discrimination of heterogeneous digital pathology images. EBioMedicine. 2018;27:317-328.

[93]	Dörrich M, Hecht M, Fietkau R, et al. Explainable convolutional neural networks for assessing head and neck cancer histopathology. Diagn Pathol. 2023;18(1):121.

[94]	Park SY, Ayana G, Wako BD, Jeong KC, Yoon SD, Choe SW. Vision transformers for low-quality histopathological images: a case study on squamous cell carcinoma margin classification. Diagnostics (Basel). 2025;15(3):260.

[95]	Wessels F, Schmitt M, Krieghoff-Henning E, et al. A self-supervised vision transformer to predict survival from histopathology in renal cell carcinoma. World J Urol. 2023;41(8):2233-2241.

[96]	Nair A, Arvidsson H, Gatica V JE, Tudzarovski N, Meinke K, Sugars RV. A graph neural network framework for mapping histological topology in oral mucosal tissue. BMC Bioinformatics. 2022;23(1):506.

[97]	Abbas SF, Vuong TTL, Kim K, Song B, Kwak JT. Multi-cell type and multi-level graph aggregation network for cancer grading in pathology images. Med Image Anal. 2023;90:102936.

[98]	Kim S, Lee E. A deep attention LSTM embedded aggregation network for multiple histopathological images. PLoS One. 2023;18(6):e0287301.

[99]	Pham TD. Time-frequency time-space long short-term memory networks for image classification of histopathological tissue. Sci Rep. 2021;11(1):13703.

[100]

Simonyan K, Vedaldi A, Zisserman A. Deep inside convolutional networks: visualising image classification models and saliency maps. arXiv. Preprint posted online 2013. arXiv: 1312.6034.

[101]

Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Grad-CAM: visual explanations from deep networks via gradient-based localization. Int J Comput Vis. 2020;128:336-359.

[102]

Zhang Z, Gu J, Chowdhury A, et al. Finer-CAM: spotting the difference reveals finer details for visual explanation. arXiv. Preprint posted online 2025. arXiv: 2501.11309.

[103]

Ilse M, Tomczak J, Welling M. Attention-based deep multiple instance learning. International conference on machine learning; 2018.

[104]

Mao J, Xu J, Tang X, et al. CAMIL: channel attention-based multiple instance learning for whole slide image classification. Bioinformatics. 2025;41(2):btaf024.

[105]

Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. Adv Neural Inf Process Syst. 2017;30.

[106]

Su Z, Akbar AR, Sajjad U, Parwani AV, Niazi MKK. Streamline pathology foundation model by cross-magnification distillation. arXiv. Preprint posted online 2025. arXiv: 2509.23097.

[107]

Chen RJ, Ding T, Lu MY, et al. Towards a general-purpose foundation model for computational pathology. Nat Med. 2024;30(3):850-862.

[108]

Vorontsov E, Bozkurt A, Casson A, et al. Virchow: a million-slide digital pathology foundation model. arXiv. Preprint posted online 2023. arXiv: 2309.07778.

[109]

Alfasly S, Alabtah G, Hemati S, Kalari KR, Tizhoosh H. Zero-shot whole slide image retrieval in histopathology using embeddings of foundation models. arXiv. Preprint posted online 2024. arXiv: 2409.04631.

[110]

Wang X, Zhao J, Marostica E, et al. A pathology foundation model for cancer diagnosis and prognosis prediction. Nature. 2024;634(8035):970-978.

[111]

Xu H, Usuyama N, Bagga J, et al. A whole-slide foundation model for digital pathology from real-world data. Nature. 2024;630(8015):181-188.

[112]

Wang X, Yang S, Zhang J, et al. Transformer-based unsupervised contrastive learning for histopathological image classification. Med Image Anal. 2022;81:102559.

[113]

Goldstraw P, Chansky K, Crowley J, et al. The IASLC lung cancer staging project: proposals for revision of the TNM stage groupings in the forthcoming (eighth) edition of the TNM classification for lung cancer. J Thorac Oncol. 2016;11(1):39-51.

[114]

Borczuk AC. Updates in grading and invasion assessment in lung adenocarcinoma. Mod Pathol. 2022;35(Suppl 1):28-35.

[115]

Niazi MKK, Hemminger J, Kurt H, Lozanski G, Gurcan M. Grading vascularity from histopathological images based on traveling salesman distance and vessel size. SPIE Medical Imaging; 2014; San Diego, California, United States.

[116]

Lu H, Rezapour M, Baha H, Niazi MKK, Narayanan A, Gurcan MN. Gene Pointnet for tumor classification. Neural Comput Appl. 2024;36(33):21107-21121.

[117]

Wang D, Khosla A, Gargeya R, Irshad H, Beck AH. Deep learning for identifying metastatic breast cancer. arXiv. Preprint posted online 2016. arXiv: 1606.05718.

[118]

Sandbank J, Bataillon G, Nudelman A, et al. Validation and real-world clinical application of an artificial intelligence algorithm for breast cancer detection in biopsies. NPJ Breast Cancer. 2022;8(1):129.

[119]

Tahir M, Hu Y, Kumar H, et al. A comprehensive AI-based approach in classifying breast lesions: focusing on improving pathologists’ accuracy and efficiency. Clin Breast Cancer. 2025;25(6):e818-e825.

[120]

U.S. Food and Drug Administration. DEN200080: Paige prostate de novo classification order. Published September 21, 2021. Accessed May 19, 2026.

[121]

Paige. Introducing FDA-Approved Paige Prostate. Published December 21, 2022. Accessed May 25, 2025.

[122]

Smith A, Belanger EC. The Paige prostate suite: assistive artificial intelligence for prostate cancer diagnosis. Canadian Agency for Drugs and Technologies in Health; 2024:EH0123.

[123]

Eloy C, Marques A, Pinto J, et al. Artificial intelligence-assisted cancer diagnosis improves the efficiency of pathologists in prostatic biopsies. Virchows Arch. 2023;482(3):595-604.

[124]

Flach RN, van Dooijeweert C, Nguyen TQ, et al. Prospective clinical implementation of Paige Prostate Detect artificial intelligence assistance in the detection of prostate cancer in prostate biopsies: Confident P trial implementation of artificial intelligence assistance in prostate cancer detection. JCO Clin Cancer Inform. 2025;9:e2400193.

[125]

Chatrian A, Colling RT, Browning L, et al. Artificial intelligence for advance requesting of immunohistochemistry in diagnostically uncertain prostate biopsies. Mod Pathol. 2021;34(9):1780-1794.

[126]

Eloy C, Marques A, Pinto J, et al. Artificial intelligence-assisted cancer diagnosis improves the efficiency of pathologists in prostatic biopsies. Virchows Arch. 2023;482(3):595-604.

[127]

Liu Y, Kohlberger T, Norouzi M, et al. Artificial intelligence-based breast cancer nodal metastasis detection: insights into the black box for pathologists. Arch Pathol Lab Med. 2019;143(7):859-868.

[128]

Zheng X, Yao Z, Huang Y, et al. Deep learning radiomics can predict axillary lymph node status in early-stage breast cancer. Nat Commun. 2020;11(1):1236.

[129]

Steiner DF, MacDonald R, Liu Y, et al. Impact of deep learning assistance on the histopathologic review of lymph nodes for metastatic breast cancer. Am J Surg Pathol. 2018;42(12):1636-1646.

[130]

Ehteshami Bejnordi B, Veta M, Johannes van Diest P, et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA. 2017;318(22):2199-2210.

[131]

Challa B, Tahir M, Hu Y, et al. Artificial intelligence-aided diagnosis of breast cancer lymph node metastasis on histologic slides in a digital workflow. Mod Pathol. 2023;36(8):100216.

[132]

Kenyon S, Sweeney BJ, Happel J, Marchilli GE, Weinstein B, Schneider D. Comparison of BD Surepath and ThinPrep Pap systems in the processing of mucus-rich specimens. Cancer Cytopathol. 2010;118(5):244-249.

[133]

Strander B, Andersson-Ellström A, Milsom I, Rådberg T, Ryd W. Liquid-based cytology versus conventional papanicolaou smear in an organized screening program : a prospective randomized study. Cancer. 2007;111(5):285-291.

[134]

Shen Z, Simard M, Brand D, et al. A deep learning framework deploying segment anything to detect pan-cancer mitotic figures from haematoxylin and eosin-stained slides. Commun Biol. 2024;7(1):1674.

[135]

Bertram CA, Aubreville M, Donovan TA, et al. Computer-assisted mitotic count using a deep learning-based algorithm improves interobserver reproducibility and accuracy. Vet Pathol. 2022;59(2):211-226.

[136]

Jahanifar M, Shephard A, Zamanitajeddin N, et al. Mitosis detection, fast and slow: robust and efficient detection of mitotic figures. Med Image Anal. 2024;94:103132.

[137]

Ganz J, Marzahl C, Ammeling J, et al. On the value of PHH3 for mitotic figure detection on H&E-stained images. arXiv. Preprint posted online 2024. arXiv: 2406.19899.

[138]

Nadji M, Gomez-Fernandez C, Ganjei-Azar P, Morales AR. Immunohistochemistry of estrogen and progesterone receptors reconsidered: experience with 5,993 breast cancers. Am J Clin Pathol. 2005;123(1):21-27.

[139]

Mann GB, Fahey VD, Feleppa F, Buchanan MR. Reliance on hormone receptor assays of surgical specimens may compromise outcome in patients with breast cancer. J Clin Oncol. 2005;23(22):5148-5154.

[140]

Hede K. Breast cancer testing scandal shines spotlight on black box of clinical laboratory testing. J Natl Cancer Inst. 2008;100(12):836-837, 844.

[141]

Collins LC, Botero ML, Schnitt SJ. Bimodal frequency distribution of estrogen receptor immunohistochemical staining results in breast cancer: an analysis of 825 cases. Am J Clin Pathol. 2005;123(1):16-20.

[142]

Badve SS, Baehner FL, Gray RP, et al. Estrogen- and progesterone-receptor status in ECOG 2197: comparison of immunohistochemistry by local and central laboratories and quantitative reverse transcription polymerase chain reaction by central laboratory. J Clin Oncol. 2008;26(15):2473-2481.

[143]

Ciocca DR, Elledge R. Molecular markers for predicting response to Tamoxifen in breast cancer patients. Endocrine. 2000;13(1):1-10.

[144]

Gelber RD, Gelber S; International Breast Cancer Study Group; Breast International Group. Facilitating consensus by examining patterns of treatment effects. Breast. 2009;18(Suppl 3):S2-S8.

[145]

Reisenbichler ES, Lester SC, Richardson AL, Dillon DA, Ly A, Brock JE. Interobserver concordance in implementing the 2010 ASCO/CAP recommendations for reporting ER in breast carcinomas: a demonstration of the difficulties of consistently reporting low levels of ER expression by manual quantification. Am J Clin Pathol. 2013;140(4):487-494.

[146]

Viale G, Regan MM, Maiorano E, et al. Prognostic and predictive value of centrally reviewed expression of estrogen and progesterone receptors in a randomized trial comparing letrozole and tamoxifen adjuvant therapy for postmenopausal early breast cancer: BIG 1-98. J Clin Oncol. 2007;25(25):3846-3852.

[147]

Thomsen C, Nielsen S, Nielsen BS, Pedersen SH, Vyberg M. Estrogen Receptor-α quantification in breast cancer: concordance between immunohistochemical assays and mRNA-In situ hybridization for ESR1 gene. Appl Immunohistochem Mol Morphol. 2020;28(5):347-353.

[148]

Bolton KL, Garcia-Closas M, Pfeiffer RM, et al. Assessment of automated image analysis of breast cancer tissue microarrays for epidemiologic studies. Cancer Epidemiol Biomarkers Prev. 2010;19(4):992-999.

[149]

Diaz LK, Sahin A, Sneige N. Interobserver agreement for estrogen receptor immunohistochemical analysis in breast cancer: a comparison of manual and computer-assisted scoring methods. Ann Diagn Pathol. 2004;8(1):23-27.

[150]

Faratian D, Kay C, Robson T, et al. Automated image analysis for high-throughput quantitative detection of ER and PR expression levels in large-scale clinical studies: the TEAM Trial Experience. Histopathology. 2009;55(5):587-593.

[151]

Rizzardi AE, Johnson AT, Vogel RI, et al. Quantitative comparison of immunohistochemical staining measured by digital image analysis versus pathologist visual scoring. Diagn Pathol. 2012;7:42.

[152]

Turbin DA, Leung S, Cheang MC, et al. Automated quantitative analysis of estrogen receptor expression in breast carcinoma does not differ from expert pathologist scoring: a tissue microarray study of 3,484 cases. Breast Cancer Res Treat. 2008;110(3):417-426.

[153]

Gokhale S, Rosen D, Sneige N, et al. Assessment of two automated imaging systems in evaluating estrogen receptor status in breast carcinoma. Appl Immunohistochem Mol Morphol. 2007;15(4):451-455.

[154]

Rexhepaj E, Brennan DJ, Holloway P, et al. Novel image analysis approach for quantifying expression of nuclear proteins assessed by immunohistochemistry: application to measurement of oestrogen and progesterone receptor levels in breast cancer. Breast Cancer Res. 2008;10(5):R89.

[155]

Ahern TP, Beck AH, Rosner BA, et al. Continuous measurement of breast tumour hormone receptor expression: a comparison of two computational pathology platforms. J Clin Pathol. 2017;70(5):428-434.

[156]

Shafi S, Kellough DA, Lujan G, Satturwar S, Parwani AV, Li Z. Integrating and validating automated digital imaging analysis of estrogen receptor immunohistochemistry in a fully digital workflow for clinical use. J Pathol Inform. 2022;13:100122.

[157]

Slamon DJ, Clark GM, Wong SG, Levin WJ, Ullrich A, McGuire WL. Human breast cancer: correlation of relapse and survival with amplification of the HER-2/neu oncogene. Science. 1987;235(4785):177-182.

[158]

Tandon AK, Clark GM, Chamness GC, Ullrich A, McGuire WL. HER-2/neu oncogene protein and prognosis in breast cancer. J Clin Oncol. 1989;7(8):1120-1128.

[159]

Press MF, Pike MC, Chazin VR, et al. Her-2/neu expression in node-negative breast cancer: direct tissue quantitation by computerized image analysis and association of overexpression with increased risk of recurrent disease. Cancer Res. 1993;53(20):4960-4970.

[160]

Park JW, Neve RM, Szollosi J, Benz CC. Unraveling the biologic and clinical complexities of HER2. Clin Breast Cancer. 2008;8(5):392-401.

[161]

Gancberg D, Järvinen T, di Leo A, et al. Evaluation of HER-2/neu protein expression in breast cancer by immunohistochemistry: an interlaboratory study assessing the reproducibility of HER-2/neu testing. Breast Cancer Res Treat. 2002;74(2):113-120.

[162]

Jacobs TW, Gown AM, Yaziji H, Barnes MJ, Schnitt SJ. Comparison of fluorescence in situ hybridization and immunohistochemistry for the evaluation of HER-2/neu in breast cancer. J Clin Oncol. 1999;17(7):1974-1982.

[163]

Wolff AC, Hammond MEH, Allison KH, et al. Human epidermal growth factor receptor 2 testing in breast cancer: American Society of Clinical Oncology/College of American Pathologists clinical practice guideline focused update. Arch Pathol Lab Med. 2018;142(11):1364-1382.

[164]

Brügmann A, Eld M, Lelkaitis G, et al. Digital image analysis of membrane connectivity is a robust measure of HER2 immunostains. Breast Cancer Res Treat. 2012;132(1):41-49.

[165]

Dobson L, Conway C, Hanley A, et al. Image analysis as an adjunct to manual HER-2 immunohistochemical review: a diagnostic tool to standardize interpretation. Histopathology. 2010;57(1):27-38.

[166]

Helin HO, Tuominen VJ, Ylinen O, Helin HJ, Isola J. Free digital image analysis software helps to resolve equivocal scores in HER2 immunohistochemistry. Virchows Arch. 2016;468(2):191-198.

[167]

Laurinaviciene A, Dasevicius D, Ostapenko V, Jarmalaite S, Lazutka J, Laurinavicius A. Membrane connectivity estimated by digital image analysis of HER2 immunohistochemistry is concordant with visual scoring and fluorescence in situ hybridization results: algorithm evaluation on breast cancer tissue microarrays. Diagn Pathol. 2011;6:87.

[168]

Skaland I, Øvestad I, Janssen EA, et al. Comparing subjective and digital image analysis HER2/neu expression scores with conventional and modified FISH scores in breast cancer. J Clin Pathol. 2008;61(1):68-71.

[169]

Holten-Rossing H, Møller Talman ML, Kristensson M, Vainer B. Optimizing HER2 assessment in breast cancer: application of automated image analysis. Breast Cancer Res Treat. 2015;152(2):367-375.

[170]

Koopman T, Buikema HJ, Hollema H, de Bock GH, van der Vegt B. What is the added value of digital image analysis of HER2 immunohistochemistry in breast cancer in clinical practice? A study with multiple platforms. Histopathology. 2019;74(6):917-924.

[171]

Koopman T, de Bock GH, Buikema HJ, et al. Digital image analysis of HER2 immunohistochemistry in gastric- and oesophageal adenocarcinoma: a validation study on biopsies and surgical specimens. Histopathology. 2018;72(2):191-200.

[172]

Hartage R, Li AC, Hammond S, Parwani AV. A validation study of human epidermal growth factor receptor 2 immunohistochemistry digital imaging analysis and its correlation with human epidermal growth factor receptor 2 fluorescence in situ hybridization results in breast carcinoma. J Pathol Inform. 2020;11:2.

[173]

Yue M, Zhang J, Wang X, et al. Can AI-assisted microscope facilitate breast HER2 interpretation? A multi-institutional ring study. Virchows Arch. 2021;479(3):443-449.

[174]

Zhang H, Moisini I, Ajabnoor RM, Turner BM, Hicks DG. Applying the new guidelines of HER2 testing in breast cancer. Curr Oncol Rep. 2020;22(5):51.

[175]

Modi S, Park H, Murthy RK, et al. Antitumor activity and safety of trastuzumab deruxtecan in patients with HER2-Low-Expressing advanced breast cancer: results from a phase Ib study. J Clin Oncol. 2020;38(17):1887-1896.

[176]

Modi S, Jacot W, Yamashita T, et al. Trastuzumab deruxtecan in previously treated HER2-low advanced breast cancer. N Engl J Med. 2022;387(1):9-20.

[177]

Yerushalmi R, Woods R, Ravdin PM, Hayes MM, Gelmon KA. Ki67 in breast cancer: prognostic and predictive potential. Lancet Oncol. 2010;11(2):174-183.

[178]

Pathmanathan N, Balleine RL, Jayasinghe UW, et al. The prognostic value of Ki67 in systemically untreated patients with node-negative breast cancer. J Clin Pathol. 2014;67(3):222-228.

[179]

Cheang MC, Chia SK, Voduc D, et al. Ki67 index, HER2 status, and prognosis of patients with luminal B breast cancer. J Natl Cancer Inst. 2009;101(10):736-750.

[180]

Petrelli F, Viale G, Cabiddu M, Barni S. Prognostic value of different cut-off levels of Ki-67 in breast cancer: a systematic review and meta-analysis of 64,196 patients. Breast Cancer Res Treat. 2015;153(3):477-491.

[181]

Pollack A, DeSilvio M, Khor LY, et al. Ki-67 staining is a strong predictor of distant metastasis and mortality for men with prostate cancer treated with radiotherapy plus androgen deprivation: Radiation Therapy Oncology Group trial 92-02. J Clin Oncol. 2004;22(11):2133-2140.

[182]

Dowsett M, Nielsen TO, A’Hern R, et al. Assessment of Ki67 in breast cancer: recommendations from the international Ki67 in breast cancer working group. J Natl Cancer Inst. 2011;103(22):1656-1664.

[183]

Christgen M, von Ahsen S, Christgen H, Länger F, Kreipe H. The region-of-interest size impacts on Ki67 quantification by computer-assisted image analysis in breast cancer. Hum Pathol. 2015;46(9):1341-1349.

[184]

Leung SCY, Nielsen TO, Zabaglo LA, et al. Analytical validation of a standardised scoring protocol for Ki67 immunohistochemistry on breast cancer excision whole sections: an international multicentre collaboration. Histopathology. 2019;75(2):225-235.

[185]

Shui R, Yu B, Bi R, Yang F, Yang W. An interobserver reproducibility analysis of Ki67 visual assessment in breast cancer. PLoS One. 2015;10(5):e0125131.

[186]

Stålhammar G, Fuentes Martinez N, Lippert M, et al. Digital image analysis outperforms manual biomarker assessment in breast cancer. Mod Pathol. 2016;29(4):318-329.

[187]

Grala B, Markiewicz T, Kozłowski W, Osowski S, Słodkowska J, Papierz W. New automated image analysis method for the assessment of Ki-67 labeling index in meningiomas. Folia Histochem Cytobiol. 2009;47(4):587-592.

[188]

Remes SM, Tuominen VJ, Helin H, Isola J, Arola J. Grading of neuroendocrine tumors with Ki-67 requires high-quality assessment practices. Am J Surg Pathol. 2012;36(9):1359-1363.

[189]

Ács B, Madaras L, Kovács KA, et al. Reproducibility and prognostic potential of Ki-67 proliferation index when comparing digital-image analysis with standard semi-quantitative evaluation in breast cancer. Pathol Oncol Res. 2018;24(1):115-127.

[190]

Stålhammar G, Robertson S, Wedlund L, et al. Digital image analysis of Ki67 in hot spots is superior to both manual Ki67 and mitotic counts in breast cancer. Histopathology. 2018;72(6):974-989.

[191]

Klauschen F, Wienert S, Schmitt WD, et al. Standardized Ki67 diagnostics using automated scoring—clinical validation in the gepartrio breast cancer study. Clin Cancer Res. 2015;21(16):3651-3657.

[192]

Koopman T, Buikema HJ, Hollema H, de Bock GH, van der Vegt B. Digital image analysis of Ki67 proliferation index in breast cancer using virtual dual staining on whole tissue sections: clinical validation and inter-platform agreement. Breast Cancer Res Treat. 2018;169(1):33-42.

[193]

Røge R, Riber-Hansen R, Nielsen S, Vyberg M. Proliferation assessment in breast carcinomas using digital image analysis based on virtual Ki67/cytokeratin double staining. Breast Cancer Res Treat. 2016;158(1):11-19.

[194]

Bankhead P, Loughrey MB, Fernández JA, et al. QuPath: open source software for digital pathology image analysis. Sci Rep. 2017;7(1):16878.

[195]

Humphries MP, Hynes S, Bingham V, et al. Automated tumour recognition and digital pathology scoring unravels new role for PD-L1 in predicting good outcome in ER–/HER2+ breast cancer. J Oncol. 2018;2018(1):1-14.

[196]

Das A, Nair MS, Peter DS. Batch mode active learning on the Riemannian manifold for automated scoring of nuclear pleomorphism in breast cancer. Artif Intell Med. 2020;103:101805.

[197]

Veta M, van Diest PJ, Willems SM, et al. Assessment of algorithms for mitosis detection in breast cancer histopathology images. Med Image Anal. 2015;20(1):237-248.

[198]

Mantrala S, Ginter PS, Mitkari A, et al. Concordance in breast cancer grading by artificial intelligence on whole slide images compares with a multi-institutional cohort of breast pathologists. Arch Pathol Lab Med. 2022;146(11):1369-1377.

[199]

Balkenhol MCA, Tellez D, Vreuls W, et al. Deep learning assisted mitotic counting for breast cancer. Lab Invest. 2019;99(11):1596-1606.

[200]

Romo-Bucheli D, Janowczyk A, Gilmore H, Romero E, Madabhushi A. Automated tubule nuclei quantification and correlation with Oncotype DX risk categories in ER+ breast cancer whole wlide images. Sci Rep. 2016;6:32706.

[201]

Nateghi R, Danyali H, Helfroush MS. A deep learning approach for mitosis detection: application in tumor proliferation prediction from whole slide images. Artif Intell Med. 2021;114:102048.

[202]

Ibrahim A, Jahanifar M, Wahab N, et al. Artificial intelligence-based mitosis scoring in breast cancer: clinical application. Mod Pathol. 2024;37(3):100416.

[203]

Azam AS, Tsang YW, Thirlwall J, et al. Digital pathology for reporting histopathology samples, including cancer screening samples—definitive evidence from a multisite study. Histopathology. 2024;84(5):847-862.

[204]

Li C, Wang X, Liu W, Latecki LJ. DeepMitosis: mitosis detection via deep detection, verification and segmentation networks. Med Image Anal. 2018;45:121-133.

[205]

Sebai M, Wang X, Wang T. MaskMitosis: a deep learning framework for fully supervised, weakly supervised, and unsupervised mitosis detection in histopathology images. Med Biol Eng Comput. 2020;58(7):1603-1623.

[206]

Mahmood T, Arsalan M, Owais M, Lee MB, Park KR. Artificial intelligence-based mitosis detection in breast cancer histopathology images using faster R-CNN and deep CNNs. J Clin Med. 2020;9(3):749.

[207]

Bulten W, Kartasalo K, Chen PC, et al. Artificial intelligence for diagnosis and gleason grading of prostate cancer: the PANDA challenge. Nat Med. 2022;28(1):154-163.

[208]

Bulten W, Pinckaers H, van Boven H, et al. Automated deep-learning system for gleason grading of prostate cancer using biopsies: a diagnostic study. Lancet Oncol. 2020;21(2):233-241.

[209]

Raciti P, Sue J, Retamero JA, et al. Clinical validation of artificial intelligence-augmented pathology diagnosis demonstrates significant gains in diagnostic accuracy in prostate cancer detection. Arch Pathol Lab Med. 2023;147(10):1178-1185.

[210]

Steiner DF, Nagpal K, Sayres R, et al. Evaluation of the use of combined artificial intelligence and pathologist assessment to review and grade prostate biopsies. JAMA Netw Open. 2020;3(11):e2023267.

[211]

Olsson H, Kartasalo K, Mulliqi N, et al. Estimating diagnostic uncertainty in artificial intelligence assisted pathology using conformal prediction. Nat Commun. 2022;13(1):7761.

[212]

Zheng Q, Mei H, Weng X, et al. Artificial intelligence-based multimodal prediction for nuclear grading status and prognosis of clear cell renal cell carcinoma: a multicenter cohort study. Int J Surg. 2025;111(6):3722-3730.

[213]

Tian K, Rubadue CA, Lin DI, et al. Automated clear cell renal carcinoma grade classification with prognostic significance. PLoS One. 2019;14(10):e0222641.

[214]

He QH, Tan H, Liao FT, et al. Stratification of malignant renal neoplasms from cystic renal lesions using deep learning and radiomics features based on a stacking ensemble CT machine learning algorithm. Front Oncol. 2022;12:1028577.

[215]

Xiong Y, Yao L, Lin J, et al. Artificial intelligence links CT images to pathologic features and survival outcomes of renal masses. Nat Commun. 2025;16(1):1425.

[216]

Maley CC, Koelble K, Natrajan R, Aktipis A, Yuan Y. An ecological measure of immune-cancer colocalization as a prognostic factor for breast cancer. Breast Cancer Res. 2015;17(1):131.

[217]

Heindl A, Sestak I, Naidoo K, Cuzick J, Dowsett M, Yuan Y. Relevance of spatial heterogeneity of immune infiltration for predicting risk of recurrence after endocrine therapy of ER+ breast cancer. J Natl Cancer Inst. 2018;110(2):166-175.

[218]

Loi S, Drubay D, Adams S, et al. Tumor-infiltrating lymphocytes and prognosis: a pooled individual patient analysis of early-stage triple-negative breast cancers. J Clin Oncol. 2019;37(7):559-569.

[219]

Hendry S, Salgado R, Gevaert T, et al. Assessing tumor-infiltrating lymphocytes in solid tumors: a practical review for pathologists and proposal for a standardized method from the international immunooncology biomarkers working group: part 1: assessing the host immune response, TILs in invasive breast carcinoma and ductal carcinoma in situ, metastatic tumor deposits and areas for further research. Adv Anat Pathol. 2017;24(5):235-251.

[220]

Le H, Gupta R, Hou L, et al. Utilizing automated breast cancer detection to identify spatial distributions of tumor-infiltrating lymphocytes in invasive breast cancer. Am J Pathol. 2020;190(7):1491-1504.

[221]

Su Z, Rezapour M, Sajjad U, Gurcan MN, Niazi MKK. Attention2 minority: a salient instance inference-based multiple instance learning for classifying small lesions in whole slide images. Comput Biol Med. 2023;167:107607.

[222]

Su Z, Afzaal U, Niu S, et al. Deep learning model for predicting lung adenocarcinoma recurrence from whole slide images. Cancers (Basel). 2024;16(17):3097.

[223]

Wang Y, Smith MR, Dixon CB, et al. IASLC grading system predicts distant metastases for resected lung adenocarcinoma. J Clin Pathol. 2025;78(6):409-415.

[224]

Su Z, Guo Y, Wesolowski R, et al. Computational pathology for accurate prediction of breast cancer recurrence: development and validation of a deep learning-based tool. Mod Pathol. 2025;38(12):100847.

[225]

Jaroensri R, Wulczyn E, Hegde N, et al. Deep learning models for histologic grading of breast cancer and association with disease prognosis. NPJ Breast Cancer. 2022;8(1):113.

[226]

Lu C, Romo-Bucheli D, Wang X, et al. Nuclear shape and orientation features from H&E images predict survival in early-stage estrogen receptor-positive breast cancers. Lab Invest. 2018;98(11):1438-1448.

[227]

Fernandez G, Prastawa M, Madduri AS, et al. Development and validation of an AI-enabled digital breast cancer assay to predict early-stage breast cancer recurrence within 6 years. Breast Cancer Res. 2022;24(1):93.

[228]

Albusayli R, Graham JD, Pathmanathan N, et al. Artificial intelligence-based digital scores of stromal tumour-infiltrating lymphocytes and tumour-associated stroma predict disease-specific survival in triple-negative breast cancer. J Pathol. 2023;260(1):32-42.

[229]

Ivanova M, Pescia C, Trapani D, et al. Early breast cancer risk assessment: integrating histopathology with artificial intelligence. Cancers (Basel). 2024;16(11):1981.

[230]

Ahn JS, Shin S, Yang SA, et al. Artificial intelligence in breast cancer diagnosis and personalized medicine. J Breast Cancer. 2023;26(5):405-435.

[231]

McCaffrey C, Jahangir C, Murphy C, Burke C, Gallagher WM, Rahman A. Artificial intelligence in digital histopathology for predicting patient prognosis and treatment efficacy in breast cancer. Expert Rev Mol Diagn. 2024;24(5):363-377.

[232]

Chan RC, To CKC, Cheng KCT, Yoshikazu T, Yan LLA, Tse GM. Artificial intelligence in breast cancer histopathology. Histopathology. 2023;82(1):198-210.

[233]

[234]

Klein ME, Dabbs DJ, Shuai Y, et al. Prediction of the Oncotype DX recurrence score: use of pathology-generated equations derived by linear regression analysis. Mod Pathol. 2013;26(5):658-664.

[235]

Romo-Bucheli D, Janowczyk A, Gilmore H, Romero E, Madabhushi A. A deep learning based strategy for identifying and associating mitotic activity with gene expression derived risk categories in estrogen receptor positive breast cancers. Cytometry A. 2017;91(6):566-573.

[236]

Liu Y, Han D, Parwani AV, Li Z. Applications of artificial intelligence in breast pathology. Arch Pathol Lab Med. 2023;147(9):1003-1013.

[237]

Soliman A, Li Z, Parwani AV. Artificial intelligence’s impact on breast cancer pathology: a literature review. Diagn Pathol. 2024;19(1):38.

[238]

Xu Z, Zhou Z, Son JB, et al. Deep learning models based on pretreatment MRI and clinicopathological data to predict responses to neoadjuvant systemic therapy in triple-negative breast cancer. Cancers (Basel). 2025;17(6):966.

[239]

Jimenez JE, Abdelhafez A, Mittendorf EA, et al. A model combining pretreatment MRI radiomic features and tumor-infiltrating lymphocytes to predict response to neoadjuvant systemic therapy in triple-negative breast cancer. Eur J Radiol. 2022;149:110220.

[240]

Krishnamurthy S, Jain P, Tripathy D, et al. Predicting response of triple-negative breast cancer to neoadjuvant chemotherapy using a deep convolutional neural network-based artificial intelligence tool. JCO Clin Cancer Inform. 2023;7:e2200181.

[241]

Huang Z, Shao W, Han Z, et al. Artificial intelligence reveals features associated with breast cancer neoadjuvant chemotherapy responses from multi-stain histopathologic images. NPJ Precis Oncol. 2023;7(1):14.

[242]

Gianfrancesco MA, Tamang S, Yazdany J, Schmajuk G. Potential biases in machine learning algorithms using electronic health record data. JAMA Intern Med. 2018;178(11):1544-1547.

[243]

Parikh RB, Teeple S, Navathe AS. Addressing bias in artificial intelligence in health care. JAMA. 2019;322(24):2377-2378.

[244]

Rajkomar A, Hardt M, Howell MD, Corrado G, Chin MH. Ensuring fairness in machine learning to advance health equity. Ann Intern Med. 2018;169(12):866-872.

[245]

Anderson B. How to bridge innovation and regulation for responsible AI in healthcare. Nat Med. 2024;30(5):1231.

[246]

Gilbert S, Mathias R, Schönfelder A, et al. A roadmap for safe, regulation-compliant living labs for AI and digital health development. Sci Adv. 2025;11(20):eadv7719.

[247]

Rahimzadeh V. US regulation of medical artificial intelligence and machine learning (AI/ML) research and development. In: Solaiman B, Cohen IG, eds. Research Handbook on Health, AI and the Law. Cheltenham, UK: Edward Elgar Publishing Ltd; 2024. Chapter 16.

[248]

Richards B, Sage Jacobson S, James Aquino YS. Regulation of AI in health care: a cautionary tale considering horses and zebras. J Law Med. 2021;28(3):645-654.

[249]

Lu MY, Chen B, Williamson DF, et al. A foundational multimodal vision language AI assistant for human pathology. arXiv. Preprint posted online 2023. arXiv: 2312.07814.

[250]

Vorontsov E, Bozkurt A, Casson A, et al. A foundation model for clinical-grade computational pathology and rare cancers detection. Nat Med. 2024;30(10):2924-2935.

[251]

Lu MY, Chen B, Williamson DFK, et al. A visual-language foundation model for computational pathology. Nat Med. 2024;30(3):863-874.

[252]

Huang Z, Bianchi F, Yuksekgonul M, Montine TJ, Zou J. A visual-language foundation model for pathology image analysis using medical twitter. Nat Med. 2023;29(9):2307-2316.

[253]

Fei N, Lu Z, Gao Y, et al. Towards artificial general intelligence via a multimodal foundation model. Nat Commun. 2022;13(1):3094.

[254]

Waqas A, Bui MM, Glassy EF, et al. Revolutionizing digital pathology with the power of generative artificial intelligence and foundation models. Lab Invest. 2023;103(11):100255.

[255]

Truhn D, Eckardt JN, Ferber D, Kather JN. Large language models and multimodal foundation models for precision oncology. NPJ Precis Oncol. 2024;8(1):72.

[256]

Xu Y, Wang Y, Zhou F, et al. A multimodal knowledge-enhanced whole-slide pathology foundation model. Nat Commun. 2025;16(1):11406.

[257]

Zou J, Topol EJ. The rise of agentic AI teammates in medicine. Lancet. 2025;405(10477):457.

[258]

Karunanayake N. Next-generation agentic AI for transforming healthcare. Informatics and Health. 2025;2(2):73-83.

[259]

Huang K. AI Agents in Healthcare. In: Agentic AI: Theories and Practices. Springer; 2025:303-321.

[260]

Sapkota R, Roumeliotis KI, Karkee M. AI agents vs. agentic AI: a conceptual taxonomy, applications and challenges. arXiv. Preprint posted online 2025. arXiv: 2505.10468.

[261]

Ferber D, El Nahhas OSM, Wölflein G, et al. Development and validation of an autonomous artificial intelligence agent for clinical decision-making in oncology. Nat Cancer. 2025;6(8):1337-1349.

[262]

Chen C, Weishaupt LL, Williamson DF, et al. Evidence-based diagnostic reasoning with multi-agent copilot for human pathology. arXiv. Preprint posted online 2025. arXiv: 2506.20964.

[263]

Li S, Xu J, Bao T, et al. A co-evolving agentic AI system for medical imaging analysis. arXiv. Preprint posted online 2025. arXiv: 2509.20279.

[264]

Shaktah LA, Carrero ZI, Hewitt KJ, et al. Application of artificial intelligence and digital tools in cancer pathology. Lancet Digit Health. 2025;7(10):100933.

RIGHTS & PERMISSIONS

The Author(s) 2026. This article is published by Higher Education Press at journal.hep.com.cn.