- Review
- Open access
- Published:
The association between tumor-stromal collagen features and the clinical outcomes of patients with breast cancer: a systematic review
Breast Cancer Research volume 27, Article number: 69 (2025)
Abstract
Background
The tumor microenvironment (TME), particularly the extracellular matrix (ECM), plays a crucial role in regulating breast cancer progression. Among ECM components, collagen type I—accounting for over 90% of fibrillar collagen in the human body—is the primary structural component of the tumor ECM. It critically modulates tumor cell behavior, influencing migration, invasion, and therapy resistance. The structural organization of collagen type I fibers can significantly impact clinical outcomes.
Methods
This systematic review aimed to assess the association between tumor-stromal collagen type I characteristics and clinical outcomes in breast cancer. A comprehensive search strategy identified studies from major databases, which were appraised using quality assessment tools. Data on collagen quantity, morphology, alignment, and organization were extracted and analyzed to explore their relationship with survival, metastasis, therapy resistance, and clinical characteristics of breast cancer.
Results
Our analysis revealed that increased collagen density—particularly with an organized/aligned fiber orientation—was strongly associated with poor prognosis. Specifically, increased intratumoral collagen quantity was linked to reduced overall survival (HR = 7.84, p = 0.031). Stage III tumors exhibiting elevated collagen uniformity showed higher metastasis rates (p = 0.004), and HER2⁺ tumors with high collagen content correlated with resistance to HER2-targeted therapies (p < 0.05). Furthermore, higher collagen curviness was associated with better outcomes, including a reduced recurrence risk (HR = 0.77, p < 0.001). Subtype-specific trends emerged as ER/PR-negative tumors more frequently exhibited a perpendicular collagen arrangement (p = 0.02), whereas ER/PR-positive tumors showed elevated COL1A1 expression (p < 0.0001). Despite these patterns, the heterogeneity of study methodologies and the complexity of the tumor microenvironment highlight the need for unified frameworks to advance clinical translation.
Conclusions
This review highlights the prognostic significance of tumor-stromal collagen characteristics in breast cancer, suggesting that future research should focus on the molecular mechanisms underlying collagen remodeling and its potential as a cancer biomarker and therapeutic target.
Introduction
Breast cancer, a significant global health challenge, affects 2.1 million women every year and results in the death of 30% of those diagnosed [1]. Increasing evidence suggests that the tumor microenvironment (TME) plays a fundamental role in tumor progression across various cancer types, including breast cancer [2]. Identifying the specific contributions of each TME component could lead to more effective cancer management strategies [3]. One hallmark feature of the TME in desmoplastic cancers, such as breast cancer, is the extracellular matrix (ECM) [2]. The ECM in breast tissue is a dynamic network of structural proteins, including collagens, elastin, periostin, laminins, fibronectin, osteonectin, thrombospondin, osteopontin, entactin, tenascin, and glycosaminoglycans, and among these, collagens are the most abundant protein in ECM [4], and play a crucial role in cancer cell survival and metastasis [5].
More than 28 types of collagens have been identified in humans and are categorized by structure and function. Fibrillar collagens, such as types I, II, and III, provide tensile strength, while collagens associated with fibrils, like types IX and XII, regulate fibril assembly. Network-forming collagens, such as type IV, constitute basement membranes, and other collagens, including types XIII and XVII, anchor cells to the ECM [4]. Collagen type I, the focus of this review, constitutes 90% of the fibrillar collagen in breast cancer stroma, making it the dominant component observed and a key determinant of matrix biophysical properties [6].
Collagen type I, referred to as collagen in this study, can affect tumor cells behavior via its biochemical properties [7]. Furthermore, the ability of cancer cells to sense and respond to mechanical signals has attracted increasing attention in recent years [8]. During cancer initiation and progression, there is a continuous reciprocal interplay between cancer cells and collagen, both within and around the tumor. This dynamic interplay results in changes in various collagen fiber parameters [9]. A variety of techniques are used to evaluate collagen parameters in breast cancer.
Methodologies for investigating collagen fiber features in breast cancer research
In addition to traditional methods such as real-time PCR (which measures mRNA expression of collagen type I, alpha 1, COL1A1) and immunohistochemistry (IHC, which detects COL1A1 protein) [10, 11], collagen structural features (such as quantity and organization) can also be assessed through different staining and imaging techniques, as well as computational analyses.
Imaging techniques
Second Harmonic Generation (SHG) microscopy
SHG microscopy is a powerful label-free optical technique widely employed for the specific imaging of collagen fibers. This method harnesses the unique structure of fibrillar collagens to generate a high-contrast signal without the need for exogenous stains or labels. The fundamental principle of SHG involves the interaction of two photons with the same frequency from an incident near-infrared laser beam with the non-centrosymmetric arrangement of collagen molecules, resulting in the emission of a single photon with exactly double the frequency (half the wavelength) of the excitation light. This intrinsic property allows for highly specific visualization of collagen [12, 13].
SHG microscopy offers several key advantages for the study of collagen in biological systems. Its ability to resolve submicron features enables the detailed visualization of collagen fiber architecture and organization [14]. The technique is versatile and can be applied to fresh, fixed, or formalin-fixed paraffin-embedded (FFPE) tissue samples, making it adaptable to various research and potential clinical applications [13]. Moreover, Polarimetric SHG (P-SHG) enhances quantitative analysis by resolving collagen fiber orientation and molecular alignment, critical for studying structural changes in pathologies like breast cancer [13, 15].
Polarized Light Microscopy (PLM)
PLM is a technique that capitalizes on the intrinsic birefringence of collagen fibers to visualize their organization within tissues. Birefringence is an optical property exhibited by anisotropic materials where the refractive index of the material varies depending on the polarization and propagation direction of light passing through it. Collagen fibers, due to their highly ordered molecular arrangement, possess this characteristic. PLM, when combined with Sirius Red staining, offers a cost-effective 2D visualization of collagen birefringence, suitable for broad screening. However, unlike SHG (which provides 3D label-free imaging), PLM lacks depth resolution and molecular specificity. However, this accessibility makes PLM a valuable tool, particularly in resource-limited settings or for initial broad screening of tissue samples [16].
Quantitative Phase Imaging (QPI)
QPI maps the optical path length difference (OPD) in tissue samples, providing insights into their nanoarchitecture without the need for stains. It works by measuring light phase shifts using interferometry, offering label-free imaging that avoids staining-related artifacts. Color Spatial Light Interference Microscopy (cSLIM) is an enhancement of QPI that allows the analysis of stained tissues. cSLIM integrates with stained samples by computationally separating stain-induced phase shifts from intrinsic tissue properties, enabling simultaneous traditional and label-free analysis. Therefore, it integrates seamlessly with current pathology workflows, enabling both traditional stain-based analysis and advanced stain-independent assessments via mathematical corrections. While QPI (and cSLIM) detects refractive index variations in dense ECM structures (including collagen), it lacks SHG’s specificity. Collagen is inferred based on structural morphology. With compatibility for stained tissues and high throughput potential, cSLIM shows great promise for clinical and pathology applications, bridging traditional and innovative methodologies effectively [17, 18].
Staining techniques
Hematoxylin and Eosin (H&E) Staining
H&E staining enables the visual identification of collagen-rich stroma in tissue samples due to their characteristic eosinophilic (pink/red) staining. This routine and cost-effective method allows for an initial assessment of the collagenous stroma. Crucially, H&E-stained slides can undergo computational analysis to yield quantitative data on collagen. While techniques like SHG offer greater specificity, the widespread availability of H&E makes computational approaches valuable for leveraging existing resources in collagen studies [17, 19].
Masson’s trichrome staining
Masson’s trichrome staining is a histological technique utilized to assess the composition of collagen by specifically highlighting collagen fibers in blue, allowing for its clear visual identification within the tissue. This method is crucial as it enables the measurement of collagen area percentage in tissue sections through subsequent image analysis. While providing valuable quantitative data on bulk collagen content for studying various pathological conditions like breast cancer, it primarily reveals the overall amount rather than detailed fibrillar-level structural organization, for which techniques like SHG are more suited [20, 21].
Sirius red and picrosirius red staining
Sirius red staining and the closely related picrosirius red staining are collagen-specific histochemical techniques that function by intensely binding to collagen fibers and enhancing their natural birefringence when viewed under polarized light, making them prominent methods for collagen structural analysis. They can be used to visualize collagen and are frequently combined with polarization microscopy to offer detailed insights into the organization of the collagen network within tissue samples. Picrosirius red staining, a variant of Sirius red, is prepared by mixing Sirius Red with a saturated aqueous solution of picric acid. When viewed under light microscopy, picrosirius red stains collagen fibers red, while epithelial cells appear yellow. This staining method under polarized light intensifies the birefringence of collagen, allowing for a more detailed assessment of its structure [22, 23].
Computational analysis
Following the acquisition of images through various staining and microscopy techniques, a range of computational methods are essential for extracting quantitative information about collagen fiber features. Automated tools like CT-FIRE (for fiber extraction and morphometry) and CurveAlign (for spatial alignment analysis) are critical for processing SHG microscopy images, enabling precise quantification of fiber length, width, alignment, and orientation relative to tumor-stroma interfaces [24]. Beyond these automated approaches, more sophisticated analytical techniques like curvelet transform are also employed. The curvelet transform is a sophisticated mathematical tool that is particularly well-adapted for analyzing images containing curve-like features, such as the fibrillar structure of collagen captured in SHG microscopy images [25]. By applying the curvelet transform to SHG images, researchers can then derive several important quantitative features of individual collagen fibers such as orientation, length, and curvature, providing a detailed characterization of the collagen network [26].
Other computational approaches provide valuable insights into collagen organization. Texture analysis, which includes both first-order (evaluating the properties of individual pixels within an image) and second-order (assessing the relationships between pairs or groups of pixels) statistical techniques, enables the extraction of features indicative of structural changes in the collagen network. A key tool within this domain is the Gray Level Co-occurrence Matrix (GLCM), a statistical approach that examines the spatial relationships between pixel pairs to analyze the texture of collagen fibers. Parameters derived from GLCM, such as correlation and energy, provide significant distinctions in collagen fiber patterns. GLCM quantifies collagen texture via spatial pixel relationships. For example, GLCM energy reflects uniformity, while correlation indicates linear patterns. Coherence, a measure of fiber alignment, is calculated separately using structure tensor analysis [25].
Another valuable computational approach for quantitatively evaluating the overall orientation of collagen fibers within SHG microscopy images is the Fast Fourier Transform (FFT) analysis. SHG image of collagen fibers is composed of various directional elements, FFT decomposes SHG images into spatial frequencies. Dominant fiber alignment produces a frequency spectrum with intensity peaks perpendicular to the fiber direction. An ellipse fitted to the spectrum quantifies alignment: a high aspect ratio (elongated ellipse) indicates strong alignment, while a circular shape reflects randomness. The aspect ratio defines the collagen orientation index [27, 28]. To derive a single, quantifiable measure of this overall orientation, an ellipse is often fitted to the distribution of frequencies in the spectrum. The shape of this fitted ellipse directly reflects the degree of fiber alignment: a highly elongated ellipse indicates a strong preference for a particular orientation, whereas a more circular ellipse suggests a lack of dominant orientation. From the parameters of this ellipse, a collagen orientation index is calculated, providing a numerical value that characterizes the overall alignment of collagen fibers within the image. This allows for an objective and comparative assessment of collagen architecture in different biological contexts [29].
The integration of computational power extends to more advanced analytical paradigms. Machine learning and supervised learning algorithms are increasingly used to quantify and score the alignment of collagen fibers concerning adjacent tumor-stromal boundaries, to predict patient survival outcomes [30]. It is worth noting that despite the advancements in automated and complex computational methods, manual quantification of collagen fiber angles using tools like ImageJ remains a relevant approach in certain research studies [31, 32].
In addition to the diverse techniques employed, cancer-related modifications of collagen have been assigned various terms in different studies. In this review, the authors classify these terms into five distinct categories. The first category is alteration of the quantity or density of collagen in the TME, which is referred to by various labels, including the area of collagen I, density, expression, deposition, content, and intensity. Collagen quantity is influenced by both biosynthesis and biodegradation. Moreover, collagen density is influenced by covalent intermolecular cross-links within collagen fibers [4]. The second category of collagen that could be affected during cancer initiation and progression is indicated by fiber morphology, encompassing fiber width, length, and straightness/curviness [24]. The third feature is the intrinsic microstructural properties of individual collagen fibers, including fibril diameter, spacing, and order/disorder packing, which can be determined by the ratio of forward to backward (F/B) propagating SHG signals [12]. When a laser beam used for SHG microscopy interacts with the collagen fibers of the studied tissue sample, it generates SHG signals that can be emitted either in the same direction as the incident light (forward) or in the opposite direction (backward). The F/B SHG ratio quantifies the relative intensity of these forward and backward SHG signals and reflects the diameter or packing arrangement of the collagen fibrils within the tissue [12]. Fibrils with thinner diameters or sparse packing structures may exhibit weaker forward SHG signals, resulting in a lower F/B ratio, which could be related to abnormal, dysregulated, and rapid synthesis of collagen fibers (Fig. 1) [33]. Notably, the F/B SHG ratio reflects the aforementioned fibril properties, not the diameter or arrangement of the fibers [12].
Schematic representation of the relationship between collagen structure and the forward-to-backward (F/B) Second Harmonic Generation (SHG) ratio. This figure illustrates how the F/B SHG ratio, obtained through SHG microscopy, reflects the microstructural properties of collagen fibers. On the left, a high F/B SHG ratio, indicative of normal tissue, is associated with well-organized collagen fibrils forming thick, intact fibers. In contrast, on the right, a low F/B SHG ratio, characteristic of pathologic tissue, suggests smaller fibril diameters, decreased order in interfibrillar packing, and increased fiber segmentation. The center panel depicts the SHG microscopy process, where a laser beam interacts with collagen, generating forward and backward SHG signals. The ratio of these signals provides insights into the collagen’s microstructural organization, which is altered in disease states
However, the arrangement of collagen fibers can change during cancer progression, and terms such as organization, orientation, alignment, directionality variation, anisotropy, and dispersion can be attributed to this variable. Therefore, the fourth category is dedicated to collagen fiber organization. Additionally, during breast cancer progression, the angle of stromal collagen fibers to the tumor boundary tends to become perpendicular, representing the fifth category [34]. Moreover, several studies have attempted to design prognostic indices based on various collagen features [25, 26, 35,36,37]. For example, researchers have defined different patterns of TME collagen I, known as tumor-associated collagen signatures (TACSs), during tumor evolution. For example, TACS-1 stands for dense collagen regions wrapped around emergent tumor foci, TACS-2 represents straightened collagen fibers stretched around a tumor, suggesting growth and increased volume, and TACS-3 is related to radially aligned collagen fibers associated with tumor cell invasion, potentially indicating invasive and metastatic growth (Fig. 2) [38].
Schematic depiction of Tumor-Associated Collagen Signatures (TACSs) during breast cancer progression. This figure illustrates the evolving patterns of collagen organization, known as TACSs, in breast cancer’s tumor microenvironment. The “Native collagen structure” panel shows the baseline arrangement of collagen fibers. TACS-1 depicts dense collagen surrounding emerging tumor, TACS-2 illustrates straightened collagen fibers stretched around the tumor, suggesting tumor growth and increased volume. Finally, TACS-3 shows radially aligned collagen fibers associated with tumor cell invasion, potentially indicating invasive growth
Another study attempted to identify and introduce more TACSs at the invasive front of breast tumors. TACS-4 involves collagen fibers forming a network near expanding tumors, creating a distinct tumor boundary. TACS-5 pertains to collagen fibers that align directionally, facilitating one-way tumor cell migration without a clear boundary. TACS-6 indicates collagen fibers align chaotically, enabling multidirectional tumor cell migration without a clear boundary. TACS-7 refers to densely packed collagen fibers at the tumor invasion front that are mostly free of tumor cells, whereas TACS-8 refers to sparsely packed collagen fibers at the tumor invasion front that are mostly free of tumor cells, given that a single tumor may contain a variety of TACSs, a prognostic index has been formulated to reflect the composite TACS profile for an individual patient [35]. Some of the important collagen features analyzed in breast cancer research were summarized in Table 1.
Several studies have explored the associations of various indices or features of collagen with different aspects of breast cancer, such as hormone receptor (PR, ER, HER2) expression, tumor stage, grade, and the likelihood of recurrence, metastasis, therapy response, and survival. Some studies have attempted to develop a prognostic index based on various TME collagen features. Our objective was to conduct a systematic review that thoroughly reviewed these studies and uncovered intricate relationships between structural collagen features and breast cancer characteristics. Our analysis offers comprehensive insight into this complex association.
Materials and methods
Eligibility criteria
The eligibility criteria and research question have been established via the PICOs framework: “P” as population, “I” as intervention, “C” as comparator/control, “O” as outcome, and “S” as study type [39]. The following inclusion criteria were based on the PICO structures:
-
1)
Observational studies (cross-sectional, case-control, and cohort studies) published in English after 1990 with available full-text and conference abstracts that evaluated the tumor-stromal collagen features of human breast cancer were included.
-
2)
Studies with human breast cancer.
-
3)
Studies have assessed the associations between TME collagen features of primary tumors in human breast cancer and the prognostic value of overall survival (OS), disease-free survival (DFS), relapse-free survival (RFS), and disease-specific survival (DSS)/cancer-specific survival (CSS).
-
4)
Studies have assessed the associations between TME collagen features of primary tumors in human breast cancer and distant metastasis, recurrence, therapy resistance, tumor stage, tumor grade, and hormonal receptors of breast cancer cells (ER, PR, and HER2).
-
5)
TME Collagen features include the following:
-
Collagen quantity: area, density, expression, deposition, content, and intensity of collagen fibers.
-
Collagen morphology: collagen fiber width, length, and straight/curviness.
-
Collagen organization: orientation, alignment, directionality variation, dispersion, and uniformity of collagen fibers.
-
Angle between the fiber direction and the tumor boundary
-
The microstructural properties of collagen are determined by the forward-SHG to backward-SHG ratio.
-
Prognostic indices were developed on the basis of collagen features.
-
Investigations that met the following criteria were excluded:
-
1)
Reviews, commentaries, case reports, and case series.
-
2)
In vitro and in vivo studies
-
3)
Studies not related to the topic of interest
-
4)
Studies for which the full text was either unavailable or not in English.
Information sources
The search was conducted in four main electronic databases: PubMed/MEDLINE, Web of Science (WOS), Scopus, and Embase via www.Embase.com until June 1, 2023, and was updated on September 1, 2024. Furthermore, the search was extended to gray literature, including conference papers. We also reviewed the references within the selected studies to perform a comprehensive search and prevent any data from being overlooked.
Search strategy
The current study was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines [40]. The primary search terms employed in the search strategy were neoplasm, breast cancer, and collagen I. The search syntax was altered in other databases (Table S1). They were limited to English, which was published after 1990.
Selection process
Three steps were performed to include a paper in the study. Initially, duplicates were eliminated using EndNote software (version X9.3.3; Thomson Reuters, Philadelphia, U.S.A.) and manual searching. Two independent authors (S.H. and F.T.) assessed the titles and abstracts of the remaining studies. Ultimately, two authors (S.H. and S.S.) separately evaluated the studies based on their full text. Any disagreement between the two authors was resolved through consensus and subsequently validated by a third author (R.G.).
Data collection process
Data from each study were independently extracted by two researchers (S.H. and S.S.) and entered into a designated “Data Extraction Form.” Discrepancies between the two authors were resolved through consensus and subsequently reviewed by a third author (F.T.).
Data items
Data extraction for each included study was conducted according to the following criteria: author, year of publication, sample size, collagen features (defined by the authors of the included studies), collagen feature category (defined by the authors of the present systematic review, including quantity, morphology, microstructural properties, organization, angle related to the tumor boundary, and invented prognostic indices), and association of collagen features with survival, metastasis, grade, stage, therapy response, recurrence, and hormonal receptor (PR, ER, HER2) status.
Quality assessment
Study quality was assessed by two researchers (S.H. and F.T.) using the Newcastle-Ottawa Scale [41] tool. The NOS tool is structured around three domains—selection, comparability, and exposure or outcome—and assigns a total possible score ranging from to 0–9 [42]. Based on this evaluation, the studies were categorized into three levels of quality: good, fair, and poor. In cases of disagreement between the two authors, a third author (Z.M.) was consulted to reach consensus.
Protocol and registration
This systematic review has been registered in the Open Science Framework (OSF), with the URL: OSF | Association of tumoral collagen topology with breast cancer patients’ clinical outcomes: A systematic review.
Result
Study selection
A comprehensive search of four major databases (PubMed/MEDLINE, Web of Science, Scopus, and Embase) from inception to September 1, 2024, yielded 21,531 references. After removing duplicates and excluding review articles, 13,081 studies remained for title and abstract screenings. Of these, 12,081 were excluded based on the eligibility criteria. Full-text assessment of the remaining 100 studies led to the exclusion of 56 studies, primarily due to insufficient data or non-compliance with the inclusion criteria. Finally, 44 studies were included in the qualitative analysis. A detailed flowchart illustrating the search process is shown in Fig. 3.
Study characteristics
All the studies (n = 44) were performed on female human breast tumor samples. Geographically, most clinical studies were conducted in the USA (n = 16) and China (n = 16). The remaining 12 studies were conducted in Singapore (2), Greece (2), Ukraine (2), Norway (1), Brazil (1), United Kingdom (1), Russia (1), Lithuania (1), and Sweden (1). Data were extracted from the included articles to evaluate and integrate the results (Tables S1 and S2 in Supplementary File 1 and Tables in Supplementary File 2). Noticeably, significant variation has been observed among studies on the type of breast cancer samples, methodology for studying collagen and defining various collagen features. To analyze the association between collagen features and breast cancer outcomes, information about methodology, collagen features, and their categories was designed by the authors of the present paper, evaluated the clinical features of patients, and extracted key findings.
Quality assessment
Of the 40 peer-reviewed studies, the majority (n = 37) were assessed as high-quality according to the NOS quality assessment tool, while the rest (n = 3) were considered fair. Fig. 4 shows the quality of the included studies.
Clinical outcomes
Survival prediction
To describe the prognostic significance of collagen features, data were subdivided into three dedicated sections: (1) collagen quantity, (2) collagen architecture, and (3) collagen-based prognostic indexes. Key findings are summarized in Table 2.
Collagen quantity and survival outcomes
The relationship between the quantity of collagen fibers and patient outcomes was studied in multiple studies. These studies suggest that this molecule can have diverse effects depending on molecular subtype of breast cancer and the location of collagen in the TME, such as the intratumor or tumor-nest boundary. In luminal breast cancer, elevated intratumoral collagen quantity correlated with worse overall survival (OS), demonstrated by a hazard ratio (HR) of 7.84 in univariate analysis and significance in multivariate analysis. However, this effect was not consistent across all breast cancer subtypes [43]. Two other studies have reported that the total amount of collagen deposition did not correlate with survival [44, 45]. Two studies revealed that a dense layer of collagen fibers surrounding tumor nests was linked to better OS (HR = 0.96) and progression-free survival (PFS, HR = 0.95) [44] and improved disease-free survival (DFS) and OS in triple-negative breast cancer (TNBC) patients [46]. However, two other studies revealed that increased overall collagen density is associated with worse DFS (HR = 2.107) [47] and disease-specific survival (DSS) with no significant correlation [23] in breast cancer patients. Another study revealed that increased intratumoral collagen uniformity (density) did not significantly affect OS [43].
Collagen architecture and survival outcomes
Multiple studies have investigated the organization of tumor collagen and its association with patient survival using SHG. One study found that several collagen framework features, including wider collagen fibers (HR=14.25), high variability in collagen fiber width (HR=5.01), a more disorganized arrangement of collagen fibers (HR=4.07), and high variability in collagen fiber arrangement across different regions of the tissue (HR=4.54), were associated with worse overall survival in invasive breast cancer patients, even after adjusting for other factors. On the other hand, median fiber straightness (HR=0.12) was associated with better overall survival [25]. Another study demonstrated that more uniform fibers alignment was associated with worse DFS (HR = 1.96, p = 0.011) and that it trended toward reduced DSS (HR = 1.59, p = 0.10) [23]. Two studies failed to establish the prognostic value of collagen orientation for OS [27, 43]. The discrepancy between these investigations could be attributed to the angle of the aligned collagen fibers. To clarify, when the studied fibers are oriented parallel to the tumor periphery, more anisotropic structured fibers could be associated with better OS (HR = 0.06, p = 0.007) [44]. However, perpendicular angle relative to the tumor boundary with more well-aligned collagen fibers (defined as TACS-3) were significantly associated with worse DFS (HR = 2.59, p = 0.002) and DSS (HR = 2.25, p = 0.008) compared to less-aligned fibers [23]. Two additional studies reinforced this trend, linking TACS-3 to poorer DFS (HR = 2.39, p = 0.017; HR = 3.04, p = 0.02) and DSS (HR = 2.17, p = 0.037; HR = 3.34, p = 0.017) [17, 48]. However, one study found no significant correlation between TACS-3 or combined TACS-2 (aligned collagen constraining tumors) and TACS-3 and OS or progression-free survival (PFS) [44].
Moreover, no significant correlation was discovered between DFS or DSS and fiber curvature [23]. Furthermore, multivariate Cox analysis revealed that better overall survival was marginally related to a lower F/B ratio after adjusting for traditional prognostic factors, although this relationship was not statistically significant (p = 0.068) [27].
Prognostic value of collagen-based indexes
Recent advances in developing prognostic biomarkers in breast cancer leverage TME collagen architecture. Xi et al. [35] developed a model using eight (TACS-1–8), where collagen alignment at the tumor’s invasive front predicted outcomes. For example, TACS-1 and TACS-4 (coefficients = 1.575 and 1.101) correlated with better survival, while others, particularly TACS-3 and TACS-6 (coefficients = 1.905 and 1.535) signaled higher risk. When combined with clinical factors, this model achieved strong accuracy for 5-year DFS (AUC: 0.854) and stratified low- and high-risk patients (HR = 5.548) [35]. Complementing this, Xi et al. [26] analyzed TACS corresponding microscopic features (TCMFs), including fiber texture and density, refining predictions. Combining TCMFs with TACS and clinical factors boosted accuracy (AUC: 0.882) and stratified low- and high-risk patients (HR = 6.9) [26]. To capture hidden prognostic clues, Qiu et al. modeled spatial collagen interactions using an intratumor graph neural network (IGNN), which outperformed traditional clinical factors (HR = 8.77) by integrating regional biomarker relationships [36]. At tumor boundaries, Huang et al. identified collagen signatures at tumor boundary (CSTB) that surpassed clinical models in risk stratification (AUC: 0.753) and enhanced nomogram accuracy [49]. Furthermore, four other studies successfully developed unique models centered on collagen features to predict survival [23, 25, 37, 46]. These studies highlight collagen’s role as a prognostic marker, offering tools to personalize treatment by identifying undertreated high-risk patients.
Recurrence
According to multiple studies, many aspects of collagen fibers are associated with the risk of recurrence (Table 3). One study showed that DCIS cases with wider collagen fiber width and higher collagen numbers (specifically within the box with a specific sizes of 48x48 µm and 96x96 µm) had a better recurrence-free survival. Conversely, DCIS cases with higher fiber straightness and greater distance to the nearest two fibers showed worse recurrence-free survival. Researchers have also introduced five composite factors, three of which are inversely associated with recurrence. These include Factor 2 (high density, long distance to boundary), which is associated with better recurrence-free survival. Factor 3 (serpentine, not straight) is also inversely associated with recurrence. Finally, Factor 5 (long, wide fibers) is also associated with better recurrence-free survival. Factor 1 (high density, low alignment) and Factor 4 (high alignment, parallel to boundary) showed no significant association with recurrence [24]. One study demonstrated that in univariate analysis, elevated intratumoral collagen quantity was associated with a fourfold increased risk of recurrence (HR = 4.13) in luminal breast cancer subtypes. Higher intratumoral collagen uniformity (reflecting fiber density) was linked to a threefold increased relapse risk (HR = 3.20), which remained significant in multivariate models intratumoral collagen organization showed no association [43]. Conversely, low expression of COL1A1 protein (indicating reduced collagen synthesis) was linked to inferior 3-year relapse-free survival (RFS; log-rank p = 0.046) in young breast cancer patients, though mRNA levels showed no association (p = 0.781) [10]. Another study identified the ln F/B ratio as prognostic marker in ER-positive, distant metastasis-positive patients treated with tamoxifen. Elevated ln F/B correlated with a 3.39-fold higher recurrence risk (HR = 3.39) [27]. Additionally, researchers have observed a positive but insignificant relationship between the alignment of collagen fibers at the tumor boundary of DCIS lesions and recurrence. Moreover, the TACS-3 pattern at the tumor boundary was not correlated with recurrence [50]. However, consistently high and low recurrence rates were associated with high and low TACS (1–8) scores, respectively [35]. Furthermore, various textural features of stromal collagen adjacent to DCIS are significantly associated with DCIS recurrence [51]. Another study introduced a new indicator called the collagen prognostic index. This index is computed by combining three distinct scores: collagen directionality score (rated 1–3), collagen solidity score (rated 1–3), and collagen dispersion score (rated 1–3). These scores are derived from the specific characteristics of collagen fibers, such as their directionality, dispersion, and density. Directionality and dispersion were measured in degrees (°), while solidity was measured in pixels. A high index was an independent predictor of poor prognosis for recurrence [22].
Cancer invasiveness
Five studies analyzed the correlation between breast cancer cell invasiveness and collagen features (Table 4). One study reported a negative relationship between collagen density and breast cancer invasiveness [52], whereas the other reported that the proportion of collagen fibers relative to tumor cells was significantly greater in invasive breast cancer cases than in in situ breast carcinomas [53], and there was no significant relationship between collagen quantity and invasiveness in the two assays [22, 53]. This discrepancy could be due to differences in the methods used to define and calculate the collagen content. Furthermore, one investigation revealed a statistically significant increase in the F/B ratio in ductal carcinoma in situ (DCIS) peritumoral stroma compared with invasive ductal carcinoma (IDC) tumoral stroma; however, this distinction between invasive and in situ carcinomas was not observed in patients with lobular carcinoma. Furthermore, an investigation indicated that the fibers of invasive breast cancers tend to be more perpendicular to the tumor boundary than those of in situ breast cancers [22]. Additionally, one study showed that in patients with lobular carcinoma, collagen alignment is more ordered in in situ breast cancers [28].
Stage
This section delves into the complex relationship between collagen features within invasive breast tumors and the established clinical staging systems. Variations in collagen organization, density, and specific molecular markers are explored in relation to overall stage, tumor size/local extent (T), lymph node involvement (N), and distant metastasis (M).
Overall stage
Several studies have explored the link between collagen features and the overall clinical stages of breast cancer (Table 5). In two investigations, higher intratumoral collagen uniformity, indicating more closely packed and evenly distributed fibers, along with increased collagen content, was associated with advanced overall stage [43, 47]. However, another study found no association between F/B ratio and stage [56]. Furthermore, one study revealed no significant correlation between tumor stage and collagen characteristics, including total collagen deposition, local collagen alignment and density, or the presence of TACS-3 and TACS-2+TACS-3 [44]. This was consistent with another study that reported no significant associations between tumor stage and the content or organization of collagen fibers [57].
T component of TNM staging system
In the TNM staging system, three papers examined the relationship between collagen features and the T category (Table 6). One study revealed that early T stage breast tissue had a significantly lower F/B ratio than healthy breast tissue. The F/B ratio was significantly elevated in advanced T stage invasive lobular carcinomas (ILCs), but not in IDC patients. These findings suggest that the microstructural characteristics of collagen fibrils during the progression of larger tumors can revert to their initial condition [54]. Another study aimed to assess the expression of collagen type I α1 (COL1A1), which encodes the major component of collagen I, via immunohistochemical (IHC) staining. In the context of T category, there was a gradual decrease in COL1A1 protein expression across T1, T2, and T3 tumors. However, in T4 tumors, COL1A1 expression increased compared to T3 tumors. Notably, the transcript expression of COL1A1 was not significantly correlated with T classification [10]. Furthermore, one study reported that collagen density showed a significant positive association with T stage, with T2 having a higher density than T1, while width and length of collagen fibers showed a significant negative association with T stage, with T1 having a greater width and length than T2. Straightness of collagen fibers showed no significant association with T stage [58].
N and M components of TNM staging system
Several studies have explored the connection between collagen characteristics and lymph node status (Table 7) or distance metastasis (Table 8).
Fifteen investigations analyzed the correlation between nodal status and collagen content and density via techniques such as SHG microscopy and staining with Sirius red, Masson’s trichrome, or trichrome Mallory (Fig. 5). Collagen content is described in various ways, including the collagen area, density, quantity, uniformity, fiber volume, deposition, intensity, content, and expression. Six of these studies indicated a positive relationship between tumor collagen content and lymph node metastasis [11, 21, 29, 43, 58, 59]. Conversely, nine studies reported no differences in tumor collagen intensity between nodal-positive and nodal-negative groups [10, 20, 29, 43, 44, 47, 57, 60]. Nevertheless, in one case, the authors indicated that statistical significance may be achieved with larger sample sizes [60]. Furthermore, in the context of distance metastasis, two investigations reported a negative correlation between collagen content and distant metastasis [20, 57].
Scattered plot summarizing the results of studies investigating the association between lymph node status of invasive breast cancer patients with content/density of collagen in the tumor microenvironment. Each dot relates to a reference showing the name of the first author (publishing year), and the collagen content/density-related variable. The size of the dots corresponds to the p-value, as shown in the guide box. Collagen Uniformity indicates whether fibers are closely packed and evenly distributed
Three studies have evaluated the relationship between lymph node or distance metastasis and the SHG F/B ratio of the tumor stroma, which depends on the fibril diameter and alignment of fibril packing in individual collagen fibers. One study noted that IDC patients with extensive lymph node involvement (N3, n = 9) showed a greater F/B ratio compared to those with less involvement (N0–N2, n = 132), though the small number of N3 cases might limit this finding. However, another study found no significant difference in the F/B ratio among ILC samples across various lymph node stages (N0–N3) or with distant metastasis (M0 vs. M1). Similarly, no significant F/B ratio differences were observed in IDC samples with varying distant metastatic statuses (M0 vs. M1). Evidence suggests that early in cancer progression, the stromal tissue around tumor cells may exhibit reduced collagen fibril thickness and a more disordered arrangement, potentially leading to changes in F/B ratio, with a possibility of reversion to a more original state as the tumor advances [54]. Furthermore, analysis revealed a significant relationship between an increased F/B ratio of the tumor and reduced incidence of metastasis in patients with ER-positive breast cancer [27, 56].
Several studies have focused on the relationship between collagen fiber organization and lymph node metastasis. Three assays revealed that lymph node-positive breast cancers have more organized collagen than lymph node-negative breast cancers [29, 57, 59]. Additionally, the fibers were straighter in lymph node-positive samples [58]. In contrast, one study reported that lymph node-negative breast cancers, compared to lymph node-positive ones, show significantly greater parallel alignment (as indicated by higher eccentricity) and fiber distribution uniformity (as indicated by higher energy). However, there was no significant difference in anisotropic alignment (as indicated by aspect ratio) between the two groups [29]. Another study found that lymph node-negative samples exhibited significantly greater length and width of collagen fibers than lymph node-positive cases. There was no significant difference in the angle of tumor growth between the two groups [58].
In one study, there was no statistically significant association between MFS length and the anisotropy of the overall orientation of collagen fibers, as measured by fast Fourier transform (FFT) image analysis, within the combined (ER-positive and ER-negative) sample set or in the ER-positive and ER-negative subpopulations, respectively. However, regarding the ER-positive group results, the negative relationship between MFS length and anisotropy (p = 0.058) may still be considered questionable and should be examined further [27]. Furthermore, two other studies did not find any association between fiber alignment and lymph node metastasis [43, 44]. Jiang et al. reported that taut, parallel collagen fibers around tumor nests were the most frequently observed collagen features in a lymph node-negative breast cancer group. The lymph node-positive group had more straightened and aligned collagen fibers, tending to be perpendicular to the tumor boundary [11]. Furthermore, there was a strong association between lymph node metastasis and TACS-1 and −4 (negative), and TACS-5 and −6 (positive). Moreover, they demonstrated that the likelihood of lymph node metastasis increased with increasing TACS (1–8) and TCMF scores [61]. Nevertheless, one investigation revealed no association between collagen alignment and the presence of TACS in different nodal statuses [44].
Grade
Sixteen studies investigated the association between breast cancer grade and collagen features (Table 9). Three investigations have been conducted on the microstructural properties of collagen fibers. One study showed that the F/B ratio of tumor samples of patients with grade I was higher than that of grade II, while there were no significant differences between grades III and I or II [54]; however, the small sample size, especially for grades I (n = 8) and III (n = 15), may not provide a statistically robust comparison. Another investigation that compared the median F/B ratio showed that there were no differences between different breast cancer grades and F/B ratios [27]. Tsafas et al. showed that, among diverse grades of breast cancer samples, the fiber anisotropy in grade I samples was the lowest, and the presence of fibrils with thin diameters or sparse packing structures in grade III samples was the highest [15].
Among the nine investigations focused on collagen density and tumor grade (Fig. 6), three studies showed no differences in collagen density and different tumor grades [24, 44, 46], while two studies showed a strong positive association between collagen density and tumor grade [22, 58], and four investigations indicated that collagen density in low-grade tumors is significantly higher than that in high-grade tumors [24, 55, 62], underscoring that the relationship between collagen density and tumor grade is not consistently defined across studies and may vary depending on factors such as tumor type or specific methodologies.
Scattered plot summarizing the results of studies investigating the association between tumor grade of breast cancer patients with density of collagen in the tumor microenvironment. Each dot relates to a reference showing the name of the first author (publishing year), the collagen density-related variable, and invasive/in situ (indicated where specified by the reference). The size of the dots corresponds to the p-value, as shown in the guide box. Collagen solidity, representing collagen density. 24x24, 48x48, 96x96 indicate counts of fibers within a 24, 48, or 96-micron square box centered on the fiber centroid. BC, breast cancer
Regarding collagen fiber alignment, two studies supported the correlation between tumor grade and increased collagen anisotropy [22, 44], while two other studies did not find any association [24, 32]. Furthermore, when examining the presence of TACS-3, one study found a significant relationship between tumor grade and TACS-3 as well as tumor grade and (TACS-2 + TACS-3) [44]. In contrast, three other studies found no differences [24, 50, 58]. Furthermore, while three studies showed no significant differences between tumor grade and collagen straightness [24, 32, 58], other studies indicated a relationship between higher tumor grade and reduced curviness, width, or length of collagen fibers [24, 58]. Adding to this complexity, there is evidence of a significant positive association between total collagen deposition and tumor grade in an invasive carcinoma study [44], while collagen dispersion was negatively associated with grade (I-II > III) [22]. Similarly, COL1A1 protein expression showed a negative association with grade [47], and distance to the nearest 2 fibers also decreased with increasing grade [24]. Conversely, total length, alignment of the nearest 4 fibers, box alignment (24x24 µm), distance to tumor/stromal boundary, COL1A1 mRNA expression, length and straightness in some studies showed no significant association with tumor grade [24, 32, 46, 47, 58].
In a recent study, researchers developed a novel approach for detecting tumor collagen in bright-field histology images of breast carcinoma, utilizing three distinct annotation methods: Low Detail Manual, where experts subjectively estimate collagen fiber directions with simple lines, resulting in less detailed annotations; High Detail Manual, which involves a more intricate manual annotation by experts to capture complex collagen patterns; and semi-automated, which merges automated thresholding with manual refinement for more objective annotations. The study found that high-grade samples were significantly associated with texture descriptors, such as energyFootnote 1, homogeneityFootnote 2, and informational measure of correlation 1Footnote 3; however, low-grade samples exhibited higher levels of fractal dimensionFootnote 4, number of endpointsFootnote 5, contrastFootnote 6, inertiaFootnote 7, averageFootnote 8, varianceFootnote 9, entropyFootnote 10, and magnitudeFootnote 11. These descriptors quantify the local variations in the gray-level co-occurrence matrix of the image. Furthermore, the study revealed that straightness and number of pixels in the mask, two key collagen features, varied with tumor grade, depending on the estimation method. High-detail manual annotations showed higher straightness in low-grade samples and more pixels in high-grade samples. In contrast, semi-automated methods indicated higher straightness and pixel count in high-grade and low-grade samples, respectively. This suggests that the annotation method can significantly influence the interpretation of collagen characteristics in breast cancer [25].
Receptor (PR, ER, HER2) status
Seven studies evaluated the relationship between hormone receptors in primary breast cancer cells and collagen structures (Table 10).
According to one study, tumors that were ER+ and PR+ had a higher collagen density than those that were negative. However, researchers found no association between HER2 and collagen content [10]. Another study reported that ER- tumors exhibited significantly higher total collagen deposition and local collagen alignment compared to ER+ tumors. Additionally, PR- tumors showed significantly higher local collagen density than PR+ tumors [44]. According to another experiment, there was no significant relationship between the hormone receptor status of cancer cells and the angle of the fibers relative to the boundary of the tumor. However, the assessment showed that low collagen density was detected more in ER-, PR-, or HER2+ and high dispersion was seen in tumors whose cancer cells were ER+ or PR+ [22].
For the determination of microstructural properties of breast cancer with different hormone receptor statuses, one study reported no association [56]. In contrast, another assay showed a higher F/B ratio in ER- and PR- -tumors than in ER+ and PR+ -tumors [27]. Finally, two articles delved into the relationship between hormone receptor status and the presence of different TACS. In this regard, while one study showed no significant relationship [44], another study confirmed that TACS-3 was higher in ER-, PR-, and HER2+ [50].
Therapy response
Five papers focused on the relationship between collagen features and therapeutic response (Table 11). According to one study, individuals with metastatic ER+ breast cancer, who were treated with tamoxifen, exhibited poorer progression-free survival (PSF) if their pretreated primary tumor had a higher F/B ratio than did those with a lower F/B ratio in the pretreated primary tumor. [27]. In HER2+ biopsy samples of pretreated primary tumors, a higher F/B ratio at the tumor-host interface has been linked to a better pathological response following neoadjuvant chemotherapy (NACT) and tumor resection, as determined by the residual cancer burden (RCB) classification. However, the F/B ratio was not significantly correlated with RCB class in TNBC biopsies [64]. Two studies reported a negative association between collagen content and neoadjuvant therapy response [47, 65]. One study revealed that fibers perpendicular to the tumor boundary tangents, fiber tortuosity, and thickness in the tumor nest borders of specimens from stage II-III HER2-positive breast cancer patients were negatively associated with pathological complete response (pCR) [66]. Another study revealed that the average fiber angle at the tumor boundary was not correlated with RCB class in HER2-positive or TNBC biopsy samples [64].
Discussion
This systematic review comprehensively examined the association between collagen architecture and breast cancer patient outcomes through September 2024. To understand this complex relationship, we explored multiple aspects of breast cancer, including survival, metastasis, treatment response, recurrence, tumor grade, stage, and receptor status. Many studies have employed advanced techniques such as SHG to study fibrillar collagen. Building on this foundation, we highlighted the potential of TACS scores and other computational models incorporating collagen features as prognostic tools. Additionally, by comparing the findings from multiple studies, we investigated the relationship between specific collagen components and distinct breast cancer characteristics, revealing potential clinical implications, conflicting results, and knowledge gaps.
While the majority of studies consistently revealed that specific collagen characteristics (including increased collagen quantity, a more aligned and anisotropic orientation, fibers arranged perpendicular to tumor boundaries, poorer microstructural properties (lower F/B SHG ratio), and straighter fibers) correlated with poorer prognosis in human breast cancer, substantial investigations found no association, and a smaller group of studies reported opposite results. Mechanistically, the increased stiffness of the collagen-rich ECM can impede immune cell infiltration [67] while facilitating cancer cell migration [68] and altering cellular metabolism to favor tumor cell invasion [69]. Conversely, collagen can play a role in antitumor immunity by providing a scaffold for antitumor cytokines [70]. However, induction of T-cell exhaustion through collagen-mediated leukocyte-associated immunoglobulin-like receptor 1 (LAIR-1) signaling complicates this picture [71]. Additionally, collagen may contribute to tumorigenesis by internalizing into cancer cells, providing energy and amino acids for their growth in a typically hypoxic and nutritionally stressed TME [72]. Thus, the overall impact of collagen on cancer progression depends on the delicate balance between these opposing effects within the complex tumor microenvironment. Additionally, aligned collagen fibers create highways that facilitate tumor cell migration and spread, especially when they are arranged perpendicular to the tumor boundary [73]. The collagen receptor, discoidin domain receptor 1 (DDR1), is crucial for establishing an aligned collagen architecture. The DDR1 ectodomain cleavages and remodels the collagen matrix into a parallel structure [74]. These aligned collagen fibers in perivascular regions and around tumor epithelial cells dictate the migratory trajectory of T cells and restrict them from entering tumor islets [41]. Anti-DDR1 antibodies disrupt collagen alignment and facilitate immune cell infiltration [74].
Based on the elegant studies by Burke, K. [27, 54] and Dessa, D. [56, 64], the poorer microstructural properties of collagen, as indicated by a lower F/B SHG ratio, are significantly associated with unfavorable prognosis in human breast cancer. This observation aligns with findings in pancreatic cancer, where increased matrix metalloproteinase (MMPs)-mediated cleaved COL-I (cCOL-I) has been linked to tumor progression [75]. This indicates that a low F/B SHG ratio (reflecting a predominance of more segmented and thinner collagen fibrils with more random inter-fibril packing structures) [12]) might indicate a higher proportion of cCOL-I within the tumor microenvironment. cCOL-I engages DDR1, initiating a signaling cascade that stimulates collagen macropinocytosis and mitochondrial biogenesis, enhances tumor bioenergetics, and ultimately promotes tumor growth and metastasis. Therefore, a lower F/B SHG ratio, as hypothesized by the higher abundance of cCOL-I, could contribute to a more aggressive tumor phenotype. Furthermore, increased metabolic activity driven by cCOL-I-mediated signaling might create a more permissive environment for tumor cell proliferation and invasion [75, 76].
However, the results of studies examining the relationship between tumor-stromal collagen characteristics and patient outcomes in breast cancer are remarkably contradictory, which underscores the complex and multifaceted role of collagen in breast cancer, highlighting the importance of context in determining the effect of collagen and the need for further investigation. To fully harness the potential of this research area, it is crucial to establish standardized methods for analyzing collagen features. By implementing standardized methodologies, researchers can better understand the influence of collagen on cancer progression. This will not only facilitate the integration of different studies but also pave the way for the development of cost-effective and valuable collagen-based prognostic tools for clinical use.
To fully understand the complex relationship between TME collagen and patient outcomes, future studies should address several critical factors.
First, ensuring an adequate sample size across all study groups is crucial. Smaller studies, as seen in some study groups of the reviewed research [27, 54] are more susceptible to chance variations and may not represent a broader patient population. Increasing sample sizes strengthens the validity of the findings and allows for more generalizable conclusions [77].
Second, standardizing methodologies for collagen analysis is highly beneficial. This encompasses various aspects, such as tissue processing and staining approaches, detection techniques, and software and calculation methods used for image analysis [78]. For instance, various steps in the processing and mounting of sample tissues may affect the F/B ratio, emphasizing the need for standardized protocols [64]. The current lack of standardization creates challenges when researchers compare the findings of different studies. By establishing standardized techniques, future research can seamlessly integrate data and pave the way for more conclusive results.
Third, addressing the inconsistencies in the definition and measurement of collagen features is critical. Our review highlighted discrepancies, such as the term “uniformity” encompassing fiber density or distribution in various studies [43, 53]. Similar inconsistencies exist for other features, such as alignment [23, 44, 50, 59] and density [22,23,24,25, 43, 44, 46, 52, 53, 55, 59, 62]. This lack of standardization hinders our ability to compare findings and ultimately to understand how collagen influences patient outcomes. Establishing consistent definitions and measurement techniques is the key to unlocking the full potential of this research area.
Fourth, determining the spatial distribution of collagen fibers relative to the tumor is critical. In situ breast cancer primarily features peritumoral collagen, whereas invasive breast cancer exhibits both peritumoral and intratumoral collagen, with collagen often interspersed among cancer cells at the invasive front [34, 35]. Notably, distinct collagen characteristics within these spatial contexts may differentially affect disease outcomes. For instance, while total collagen deposition in invasive breast cancer did not correlate with survival, local collagen density, measured as collagen content within a 50-pixel radius from the tumor-normal boundary, was significantly associated with a better prognosis [44]. This discrepancy, potentially attributable to variations in the collagen region of interest and methodological approaches, suggests that collagen quantity alone may not fully capture its influence on tumor behavior.
Fifth, the clinical heterogeneity of breast cancer, including receptor expression, tumor grade, invasiveness, origin, treatment history, and age, is crucial for interpreting collagen-related findings. The prognostic value of collagen is context-dependent, as evidenced by disparate results across studies. For instance, while an elevated F/B ratio correlates with favorable outcomes in ER+ breast cancer patients, its prognostic significance in other subpopulations remains equivocal [27]. Given that COL1A1 expression is often higher in ER+ tumors [10, 45], potentially influenced by estrogen, and F/B (reflecting fibril diameter and packing) was observed to be lower in ER+ patients [27], it could be hypothesized that while estrogen may promote collagen production, it might also contribute to disrupted fibrillogenesis within the malignant context. Meanwile, in ER+ group, patients exhibiting a higher F/B ratio might represent a subgroup where fibril diameter and packing are relatively preserved, potentially counteracting some of the adverse effects associated with estrogen-driven ECM alterations and leading to improved outcomes. One study reported a positive association between F/B ratio in ductal carcinoma and N stage, with N3 tumors showing a higher F/B than N0-N2. This is particularly interesting as N3 represents more extensive lymph node metastases. The authors speculate this “normalization” of F/B in highly metastatic tumors might facilitate further spread. Intensity threshold-based F/B at the tumor-host interface is correlated with the RCB class in HER2+ biopsy samples, but this correlation was not detected in TNBC samples [64]. Moreover, a notable study reported a higher F/B ratio in DCIS peritumoral stroma collagen than in collagen fibers within the IDC tumor, but this disparity was absent in lobular carcinoma [54]. Additionally, variations in ECM remodeling from early-stage to late-stage disease and differences in mutation profiles further modulate collagen characteristics, contributing to the complex and sometimes conflicting prognostic associations observed across studies [34]. For example, TP53 mutations are often associated with increased MMP activity, potentially leading to a more fragmented collagen network [79], while PIK3 CA mutations might enhance growth factor signaling that promotes collagen deposition and alignment [80]. This emphasis on the complex nature of the relationship between tumor cells and collagen characteristics and underscores the need for more comprehensive and well-designed research. Given the importance of these studies, the authors have provided all available patient data of included papers in Supplementary File 2. These detailed patient data can help researchers develop more accurate prognostic models and targeted therapies based on collagen characteristics.
Conclusion
In conclusion, this review highlights the complex and often contradictory roles of collagen in breast cancer. Although promising, the field requires standardized methodologies and larger studies to establish collagen as a reliable prognostic or predictive biomarker. Given its potential impact on tumor biology, exploring the role of collagen in other cancers is warranted.
Data availability
No datasets were generated or analysed during the current study.
Notes
Energy is a texture descriptor derived from the spatial gray-level co-occurrence matrix that measures the uniformity of pixel pair repetitions. High energy values indicate more repetitive patterns and textures in the image.
Homogeneity assesses the closeness of the distribution of elements in the spatial gray-level co-occurrence matrix to the matrix diagonal. Higher homogeneity values suggest less contrast and a more uniform texture across the image.
Informational Measure of Correlation 1 measures the correlation of a pixel to its neighbor over the whole image. It’s based on the entropy of the gray-level co-occurrence matrix and the entropy of the image, providing insight into the complexity and the amount of information in the image’s texture.
Fractal dimension indicates the complexity of the collagen framework, with higher values suggesting a more intricate and detailed pattern.
A higher number of endpoints typically indicates a more complex and denser collagen network.
A higher contrast value indicates greater disparity in intensity between neighboring pixels, which often corresponds to a more pronounced texture in the image.
Inertia is a method of examining the spatial relationship between pixels. The higher the inertia value, the greater the local contrast and the more complex the texture pattern in the image.
The average intensity value of pixels in the gray-level co-occurrence matrix which reflects the average level of gray in the texture.
Variance measures the spread of the intensity values around the average, which indicates the amount of variation or dispersion from the average.
Entropy is a measure of randomness or complexity in the image texture. High entropy values indicate more complex and less predictable textures.
Magnitude summarizes the collagen framework’s directional properties.
Abbreviations
- BCSS:
-
Breast cancer-specific survival
- CSS:
-
Cancer-specific survival
- cCOL-I:
-
cleaved COL-I
- CSTB:
-
Collagen signatures at tumor boundary
- CIs:
-
Confidence intervals
- COL1A1:
-
Collagen type I α1
- CVF:
-
Collagen volume fraction
- DDR1:
-
Discoidin domain receptor 1
- DFS:
-
Disease-free survival
- DSS:
-
Disease-specific survival
- DCIS:
-
Ductal carcinoma in situ
- ECM:
-
Extracellular matrix
- F/B:
-
Forward-emitted to backward-emitted
- HRs:
-
Hazard ratios
- H&E:
-
Hematoxylin and eosin
- IHC:
-
Immunohistochemistry
- IGNN:
-
Intratumor graph neural network
- IDC:
-
Invasive ductal carcinoma
- LAIR-1:
-
Leukocyte-associated immunoglobulin-like receptor 1
- MMP:
-
Matrix metalloproteinase
- MFS:
-
Metastasis-free survival
- NACT:
-
Neoadjuvant chemotherapy
- NOS:
-
Newcastle‒Ottawa scale
- OS:
-
Overall survival
- pCR:
-
Pathological complete response
- PSF:
-
Progression-free survival
- RFS:
-
Relapse-free survival
- RCB:
-
Residual cancer burden
- SHG:
-
Second harmonic generation
- TCMF:
-
TACS corresponding microscopic features
- TME:
-
Tumor microenvironment
- TACS:
-
Tumor-associated collagen signature
- WOS:
-
Web of Science
References
Bray F, et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394–424.
Anderson NM, Simon MC. The tumor microenvironment. Curr Biol. 2020;30(16):R921–5.
Li JJ, Tsang JY, Tse GM. Tumor microenvironment in breast cancer—updates on therapeutic implications and pathologic assessment. Cancers. 2021;13(16):4233.
Lepucki A, et al. The role of extracellular matrix proteins in breast cancer. J Clin Med. 2022;11(5):1250.
Martinez J, Smith PC. The dynamic interaction between extracellular matrix remodeling and breast tumor progression. Cells. 2021;10(5):1046.
Lo Buglio G, et al. The multifaced role of collagen in cancer development and progression. Inte J Mol Sci. 2024;25(24):13523.
Zhang Q, et al. Collagen code in tumor microenvironment: functions, molecular mechanisms, and therapeutic implications. Biomed Pharmacother. 2023;166:115390.
Cambi A, Ventre M. Collagen-based biomimetic systems to study the biophysical tumour microenvironment. Cancers (Basel). 2022;14(23):5939.
Baldari S, et al. Strategies for efficient targeting of tumor collagen for cancer therapy. Cancers. 2022;14(19):4706.
Chekhun V, et al. Features of COL1A1 expression in breast cancer tissue of young patients. Exp Oncol. 2023;45(3):351–63.
Jiang Y, et al. Collagen fiber features and COL1A1: are they associated with elastic parameters in breast lesions, and can COL1A1 predict axillary lymph node metastasis? BMC Cancer. 2022;22(1):1004.
Williams RM, Zipfel WR, Webb WW. Interpreting second-harmonic generation images of collagen I fibrils. Biophys J. 2005;88(2):1377–86.
Golaraei A, et al. Polarimetric second-harmonic generation microscopy of the hierarchical structure of collagen in stage I-III non-small cell lung carcinoma. Biomed Opt Express. 2020;11(4):1851–63.
Golaraei A, et al. Changes of collagen ultrastructure in breast cancer tissue determined by second-harmonic generation double Stokes-Mueller polarimetric microscopy. Biomed Opt Express. 2016;7(10):4054–68.
Tsafas V, et al. Polarization-dependent second-harmonic generation for collagen-based differentiation of breast cancer samples. J Biophotonics. 2020;13(10):e202000180.
Oldenbourg R. Polarized light microscopy: principles and practice. Cold Spring Harb Protoc. 2013;2013(11):pdb.top078600.
Majeed H, et al. Quantitative histopathology of stained tissues using Color Spatial Light Interference Microscopy (cSLIM). Sci Rep. 2019;9:14.
Ding H, et al. Measuring the scattering parameters of tissues from quantitative phase imaging of thin slices. Opt Lett. 2011;36(12):2281–3.
Dey P. Basic and advanced laboratory techniques in histopathology and cytology. Singapore: Springer; 2018.
Sun C, et al. Quantitative measurement of breast carcinoma fibrosis for the prediction in the risk of bone metastasis. Am J Transl Res. 2018;10(6):1852–9.
Wang L, et al. Is macroscopic tumor stiffness on strain elastography related to the axillary nodal status in T1 stage ductal invasive breast cancer? Int J Clin Exp Med. 2016;9(2):3371–9.
Toss MS, et al. Geometric characteristics of collagen have independent prognostic significance in breast ductal carcinoma in situ: an image analysis study. Mod Pathol. 2019;32(10):1473–85.
Bredfeldt JS, et al. Automated quantification of aligned collagen for human breast carcinoma prognosis. J Pathol Inform. 2014;5(1):28.
Sprague BL, et al. Collagen organization in relation to ductal carcinoma in situ pathology and outcomes. Cancer Epidemiol Biomarkers Prev. 2021;30(1):80–8.
Morkunas M, et al. Tumor collagen framework from bright-field histology images predicts overall survival of breast carcinoma patients. Sci Rep. 2021;11(1):15474.
Xi G, et al. Computer-assisted quantification of tumor-associated collagen signatures to improve the prognosis prediction of breast cancer. BMC Med. 2021a;19(1):273.
Burke K, et al. Using second harmonic generation to predict patient outcome in solid tumors. BMC Cancer. 2015;15:929.
Shen TF, et al. Monitoring the progression of lobular breast carcinoma using multiphoton microscopy. Laser Physics Letters. 2019;16(10):9.
Kakkad SM, et al. Collagen i fiber density increases in lymph node positive breast cancers: Pilot study. J Biomed Opt. 2012;17(11):116017.
Nguyen TL, et al. Quantitative phase imaging: recent advances and expanding potential in biomedicine. ACS Nano. 2022;16(8):11516–44.
Chen Y, Yu Q, Xu CB. A convenient method for quantifying collagen fibers in atherosclerotic lesions by ImageJ software. 2017.Chen, Ying, Qiuhao Yu, and Cang-Bao Xu. A convenient method for quantifying collagen fibers in atherosclerotic lesions by ImageJ software. Int J Clin Exp Med. 2017;1(10):14904–10.
Brabrand A, et al. Alterations in collagen fibre patterns in breast cancer. A premise for tumour invasiveness? APMIS. 2015;123(1):1–8.
Perry SW, et al. Stromal matrix metalloprotease-13 knockout alters Collagen I structure at the tumor-host interface and increases lung metastasis of C57BL/6 syngeneic E0771 mammary tumor cells. BMC Cancer. 2013;13(1):411.
Provenzano PP, et al. Collagen reorganization at the tumor-stromal interface facilitates local invasion. BMC Med. 2006;4(1):38.
Xi GQ, et al. Large-scale tumor-associated collagen signatures identify high-risk breast cancer patients. Theranostics. 2021b;11(7):3229–43.
Qiu L, et al. Intratumor graph neural network recovers hidden prognostic value of multi-biomarker spatial heterogeneity. Nat Commun. 2022;13(1):4250.
Ren Y, et al. Improved quantitative fibrosis indices reveal diverse survivals of triple negative breast cancer patients. Cancer Res. 2023;83(5):P6-04-11.
Provenzano PP, et al. Collagen reorganization at the tumor-stromal interface facilitates local invasion. BMC Med. 2006;4:1–15.
Munn Z, et al. What kind of systematic review should I conduct? A proposed typology and guidance for systematic reviewers in the medical and health sciences. BMC Med Res Methodol. 2018;18(1):5.
Page MJ, et al. PRISMA 2020 explanation and elaboration: updated guidance and exemplars for reporting systematic reviews. BMJ. 2021;372:n160.
Salmon H, et al. Matrix architecture defines the preferential localization and migration of T cells into the stroma of human lung tumors. J Clin Invest. 2012;122(3):899–910.
Stang A. Critical evaluation of the Newcastle-Ottawa scale for the assessment of the quality of nonrandomized studies in meta-analyses. Eur J Epidemiol. 2010;25(9):603–5.
Natal RA, et al. Collagen analysis by second-harmonic generation microscopy predicts outcome of luminal breast cancer. Tumor Biol. 2018;40(4):1010428318770953.
Esbona K, et al. The presence of cyclooxygenase 2, tumor-associated macrophages, and collagen alignment as prognostic markers for invasive breast carcinoma patients. Am J Pathol. 2018;188(3):559–73.
Jansson M, et al. Stromal type I collagen in breast cancer: correlation to prognostic biomarkers and prediction of chemotherapy response. Clin Breast Cancer. 2024;24(5):e360-e369.e4.
Gole L, et al. Quantitative stain-free imaging and digital profiling of collagen structure reveal diverse survival of triple negative breast cancer patients. Breast Cancer Res. 2020;22(1):13.
Hacking SM, et al. Whole slide image features predict pathologic complete response and poor clinical outcomes in triple-negative breast cancer. Pathol Res Pract. 2023;246:154476.
Conklin MW, et al. Aligned collagen is a prognostic signature for survival in human breast carcinoma. Am J Pathol. 2011;178(3):1221–32.
Huang, Xingxin, Fangmeng Fu, Wenhui Guo, Deyong Kang, Xiahui Han, Liqin Zheng, Zhenlin Zhan, Chen Wang, Qinghua Zhang, Shuhua Wang, and Shusen Xu. Prognostic significance of collagen signatures at breast tumor boundary obtained by combining multiphoton imaging and imaging analysis. Cell Oncol. 2024;47(1):69–80.
Conklin MW, et al. Collagen alignment as a predictor of recurrence after ductal carcinoma in situ. Cancer Epidemiol Biomark Prev. 2018;27(2):138–45.
Upadhaya T, et al. Prediction of breast ductal carcinoma in situ recurrence using histomics analysis of stromal collagen from second-harmonic generation and hematoxylin and eosin stain-based images. Cancer Res. 2022;82(4):P1-02-16.
Wu Y, et al. Monitoring the progression from intraductal carcinoma to invasive ductal carcinoma based on multiphoton microscopy. J Biomed Opt. 2015;20(9):096007.
Gubarkova EV, et al. Multiphoton tomography in differentiation of morphological and molecular subtypes of breast cancer: a quantitative analysis. J Biophotonics. 2021;14(5):e202000471.
Burke K, Tang P, Brown E. Second harmonic generation reveals matrix alterations during breast tumor progression. J Biomed Opt. 2013;18(3):31106.
Wu Y, et al. Monitoring morphological alterations during invasive ductal breast carcinoma progression using multiphoton microscopy. Lasers Med Sci. 2015;30(3):1109–15.
Desa DE, et al. Intratumoral heterogeneity of second-harmonic generation scattering from tumor collagen and its effects on metastatic risk prediction. BMC Cancer. 2020;20(1):1217.
Hall G, et al. SHG fiberscopy assessment of collagen morphology and its potential for breast cancer optical histology. IEEE Trans Biomed Eng. 2024;71(8):2414–20.
Lukianova N, et al. Development of an algorithm for biomedical image analysis of the spatial organization of collagen in breast cancer tissue of patients with different clinical status. FEBS Open Bio. 2024;14(4):675–86.
Gao H, et al. 3D extracellular matrix regulates the activity of T cells and cancer associated fibroblasts in breast cancer. Front Oncol. 2021;11:764204.
Xue XW, et al. Kindlin-2 could influence breast nodule elasticity and improve lymph node metastasis in invasive breast cancer. Sci Rep. 2017;7:10.
Fang Y, et al. Collagen signature as a novel biomarker to predict axillary lymph node metastasis in breast cancer using multiphoton microscopy. J Biophotonics. 2022;15(6):e202100365.
Chen Z, et al. Label-free identification of early stages of breast ductal carcinoma via multiphoton microscopy. Scanning. 2020;2020:8.
Chen X, et al. Monitoring changes of collagen fibers surrounding breast ductal carcinoma in situ using multiphoton microscopy. In: Proceedings of SPIE - The International Society for Optical Engineering. 2023.
Desa DE, et al. Second-harmonic generation directionality is associated with neoadjuvant chemotherapy response in breast cancer core needle biopsies. J Biomed Opt. 2019;24(8):1–9.
Huang JX, et al. Comparing shear wave elastography of breast tumors and axillary nodes in the axillary assessment after neoadjuvant chemotherapy in patients with node-positive breast cancer. Radiol Med. 2024;129(8):1143–55.
Nguyen TH, et al. Quantitative analysis of fiber-level collagen features in H&E whole-slide images predicts neoadjuvant therapy response in patients with HER2+ breast cancer. Cancer Res. 2023;83(5):P5-02-09.
Kuczek DE, et al. Collagen density regulates the activity of tumor-infiltrating T cells. J Immunother Cancer. 2019;7(1):68.
Wu Y, et al. Matrix-driven changes in metabolism support cytoskeletal activity to promote cell migration. Biophys J. 2021;120(9):1705–17.
Morris BA, et al. Collagen matrix density drives the metabolic shift in breast cancer cells. EBioMedicine. 2016;13:146–56.
Momin N, et al. Anchoring of intratumorally administered cytokines to collagen safely potentiates systemic cancer immunotherapy. Sci Transl Med. 2019;11(498):eaaw2614.
Peng DH, et al. Collagen promotes anti-PD-1/PD-L1 resistance in cancer through LAIR1-dependent CD8(+) T cell exhaustion. Nat Commun. 2020;11(1):4520.
Olivares O, et al. Collagen-derived proline promotes pancreatic ductal adenocarcinoma cell survival under nutrient limited conditions. Nat Commun. 2017;8:16031.
Provenzano PP, et al. Collagen density promotes mammary tumor initiation and progression. BMC Med. 2008;6(1):11.
Sun X, et al. Tumour DDR1 promotes collagen fibre alignment to instigate immune exclusion. Nature. 2021;599(7886):673–8.
Su H, et al. Collagenolysis-dependent DDR1 signalling dictates pancreatic cancer outcome. Nature. 2022;610(7931):366–72.
Su H, Karin M. Collagen architecture and signaling orchestrate cancer development. Trends Cancer. 2023;9(9):764–73.
Banerjee A, Chaudhury S. Statistics without tears: populations and samples. Ind Psychiatry J. 2010;19(1):60–5.
Liu, Yuming, Adib Keikhosravi, Guneet S. Mehta, Cole R. Drifka, and Kevin W. Eliceiri. Methods for quantifying fibrillar collagen alignment. In Fibrosis: Methods and Protocols, edited by Paolo Bonaldo and Mauricio Rojas. New York: Springer; 2017. p. 429–51.
Zhu G, et al. Mutant p53 in cancer progression and targeted therapies. Front Oncol. 2020;10:595187.
An Y, et al. Cdh1 and Pik3ca mutations cooperate to induce immune-related invasive lobular carcinoma of the breast. Cell Rep. 2018;25(3):702-714.e6.
Acknowledgements
Not applicable.
Funding
This study was funded by a grant from the Iran University of Medical Sciences (Grant No: 1403-2-99-30621).
Author information
Authors and Affiliations
Contributions
R.G. and S.H. conceived and designed the study. F.T. provided the search syntax. F.T. conducted the database search. S.H., F.T., S.S., and S.D. screened and selected data. F.T. assessed data quality. S.H. extracted data and prepared the initial draft of the manuscript, while F.T. drafted the Method section. S.H. prepared and illustrated Figures and Tables. F.K. and B.K. provided expert opinions. Z.M. and R.G. reviewed and edited the final manuscript. All authors contributed to data interpretation and critical revision of the manuscript for important intellectual content.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
This systematic review was approved by the Ethics Committee of the Iran University of Medical Sciences (IR.IUMS.REC.1403.545). As this review relied solely on publicly accessible published data, formal consent to participate was not required.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Heydari, S., Tajik, F., Safaei, S. et al. The association between tumor-stromal collagen features and the clinical outcomes of patients with breast cancer: a systematic review. Breast Cancer Res 27, 69 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13058-025-02017-6
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13058-025-02017-6