Abstract
Interdisciplinary research is a driving force of transformative and innovative science, yet it remains unclear how early-career scientists pursue interdisciplinary research paths. Analyzing data from 675,135 doctoral theses of U.S. Ph.D. graduates who graduated from 1950 to 2016, we study the development of interdisciplinary doctoral theses in the five scientific domains of behavioral sciences, biological sciences, engineering, health and medical sciences, and mathematical and physical sciences. We propose an indicator to measure the degree of interdisciplinarity embedded in the doctoral research by employing co-occurrence matrices of subjects assigned to doctoral theses in the ProQuest Dissertations & Theses Database. This study finds that interdisciplinary doctoral theses have exhibited a growing trend across different scientific domains, and universities of varying research intensity. Since the 1990s, interdisciplinary research has played a dominant role in doctoral theses within the five scientific domains. The results of multivariate regression models suggest persistent gender disparities in the interdisciplinarity level of doctoral theses. Specifically, male-authored doctoral theses demonstrate a higher level of interdisciplinarity than female-authored doctoral theses. In addition, this study suggests that being supervised by female advisors may amplify gender disparities in the interdisciplinarity level of their students’ doctoral theses. The findings indicate the potential underrepresentation of female scientists in pursuing interdisciplinary research at the early stages of their careers. Given that funding agencies have promoted interdisciplinary research and its potential benefits, the lower level of interdisciplinarity in the doctoral theses of female students may hinder their career advancement. Furthermore, our findings indicate that offering increased support to female faculty members may not only directly benefit their career development but also hold considerable significance in promoting future generations of female scientists. The findings of this study have important policy implications for advancing the careers of female scientists.
Similar content being viewed by others
Introduction
Interdisciplinary research (hereafter IDR) has been widely recognized as a catalyst for transformative and innovative science, promoting national competitiveness. In recent decades, interdisciplinary approaches have increasingly been applied in science, education, and policy domains to tackle emerging research problems, such as public health issues, climate change, and biodiversity loss (de Macedo et al., 2023; Puppim de Oliveira and Qian, 2023; Sentell et al., 2020). IDR is often considered a research mode that necessitates an integration of concepts, techniques, theories, and/or data from two or more disciplines or specialized areas of knowledge (National Academy of Sciences, 2004). The theory of recombinant novelty suggests that the integration of knowledge elements from distant sources is more likely to bring about innovative outcomes, which is also true at the disciplinary level (Fleming, 2001; Liu et al., 2022; Weitzman, 1998). Since the 1960s, interdisciplinary issues have become a significant topic in the discourse on knowledge creation and science policy. As important ideas often flourish beyond the confines of a single discipline, policy instruments in many countries have encouraged IDR with high expectations for its benefits. For instance, in 2020, the National Natural Science Foundation of China (NSFC) launched a new division, i.e., the Department of Interdisciplinary Science, to push forward IDR to meet national strategic needs.
Given the potential benefits of IDR and the ongoing policy push, the capacity to perform IDR is becoming increasingly critical for scientists. However, there is surprisingly little knowledge about how early-career scientists (hereafter ECSs) are adapting to this trend toward interdisciplinarity, and whether gender plays a role in this process. The “publish or perish” imperative puts pressure on ECSs to publish frequently to secure a job in a competitive market. The research strategies they adopt may yield different outcomes for their survival and success in academia. To ensure that they remain in academia, ECSs should specialize and gain recognition within specific fields (Heiberger et al., 2021), while also becoming independent scientists and providing innovative perspectives.
Performing IDR can be particularly challenging for ECSs due to institutional, cultural, and practical barriers that make it risky. Despite the potential benefits of integrating knowledge from diverse disciplines, pursuing IDR is often a difficult endeavor. Empirical evidence suggests that IDR that requires additional commitment and effort can be difficult to produce, and may result in research outcomes of lower quality (Kovacs and Lex, 2012), lead to lower productivity (Leahey et al., 2017), create ambiguous identity, and receive less recognition (Hsu et al., 2009). Therefore, IDR entails high risks and uncertainties for ECSs, especially for female ECSs who are underrepresented in STEM fields and face systematic barriers that impede their careers in the research system (Huang et al., 2020). Female scientists exhibit disadvantages including reduced access to research funding (Larivière et al., 2011), stereotypes about their roles and abilities (Eagly et al., 2020), challenges in collaboration and leadership (Liu et al., 2022), and difficulties in balancing childcare responsibilities with the demands of their careers (Fox, 2005). As a result, previous studies find that female scientists exhibit inferior research performance and a greater likelihood of leaving academia than their male counterparts.
Given the riskier path of IDR and their vulnerable positions, female scientists tend to be more cautious or conservative in their research strategies than their male colleagues. This caution originates from their fear of being criticized (Etzkowitz, Kemelgor, and Uzzi, 2000), making mistakes (Correll, 1997) or acting impulsively (Rier, 2003). Besides, literature in sociology, economics and psychology points out the gender difference in response to risk. Empirical and experimental evidence suggests that women are less likely than men to engage in risky behaviors (Charness and Gneezy, 2012; Eckel and Grossman, 2008). Therefore, we presume that female ECSs may adopt safer research strategies and be more hesitant to engage in IDR.
The investigation of the interdisciplinarity of ECSs’ research is essential, particularly concerning gender and mentorship. Despite a few exceptions (Sugimoto et al., 2011; Unger et al., 2022), there is a lack of rigorous investigations of ECSs’ preferences for IDR. Studying IDR among ECSs can provide insights into how they embrace the practices and policies that favor IDR by adopting different research strategies, and how intellectual preferences for IDR are influenced by gender and mentorship. Understanding gender differences in engaging in IDR can help reveal the driving forces and mechanisms underlying the productivity puzzle and the glass ceiling in science education and research strategies.
Unlike a few existing studies that investigate ECSs’ publications (Lee, 2019; Zhang and Yu, 2020), such as journal articles or conference papers, we focus on doctoral theses to uncover their engagement in IDR. Doctoral theses are considered a crucial requirement for independent research contribution (Donner, 2021) and the most important research output of junior scientists (Vera-Baceta et al., 2019). Doctoral theses can reflect a part of the academic culture, thinking, learning, and writing skills of students, which allows us to reflect on ECSs’ participation in IDR, and helps to provide a clearer picture of ECSs’ interdisciplinary research capacity.
The degree of interdisciplinarity can vary across scientific domains. Previous literature has suggested that humanities tend to have a lower level of interdisciplinarity, while biomedicine, physics, and chemistry exhibit higher levels, compared to other domains (Morillo et al., 2003). Given the severe underrepresentation of female scientists in hard science and the disciplinary variations in how scientists incorporate knowledge from diverse domains (Huang et al., 2020; Liu et al., 2018; Liu et al., 2017), this study specifically focuses on five representative domains within the hard sciences. These domains include behavioral sciences, biological sciences, engineering, health and medical sciences, and mathematical and physical sciences.
Three key factors drive our investigation into scientists from the United States (hereafter the U.S.). First, the U.S. maintains a position at the forefront of science and technology, making it a significant contributor to global scientific advancements (White, 2019). The high-quality doctorate education system in this country plays a crucial role in building and sustaining its national scientific capacity (Sarrico, 2022). As a result, doctoral graduates from the U.S. serve as representative samples for investigating ECSs. Second, the U.S. serves as an example of strong interdisciplinary policies for higher education and research (National Academy of Sciences, 2004). The ECSs from the U.S. often receive training in IDR and might exhibit a strong motivation to perform IDR in their theses. In addition, gender differences are notable in U.S. academia, with systematic barriers affecting female scientists within the research system (Sá et al., 2020). This allows us to capture gender disparities in the research strategies employed by ECSs, thus shedding light on potential gender-related differences regarding engagement in IDR within their doctoral theses.
This study investigates the doctoral theses of 675,135 Ph.D. students who completed their doctorates at universities in the U.S. between 1950 and 2016 across five scientific domains: behavioral sciences, biological sciences, engineering, health and medical sciences, and mathematical and physical sciences. We construct an indicator to measure the degree of interdisciplinarity embedded in the doctoral research by utilizing co-occurrence matrices of subjects assigned to doctoral theses in the ProQuest Dissertations & Theses Database. To provide a comprehensive understanding of ECSs’ engagement in IDR, we propose the following research questions:
RQ1: What is the prevalence of interdisciplinary research among early-career scientists? To address RQ1, our analysis focuses on examining the temporal evolution of the interdisciplinarity indicator and its distributions across five scientific domains, and universities of different research intensity.
RQ2: How does gender influence ECSs’ engagement in IDR? This study explores the gender differences in ECSs’ engagement in IDR with a focus on temporal changes, disciplinary disparities, and differences across universities of varying research quality.
RQ3: Is the gender of advisors correlated with students’ participation in IDR? This study examines if the gender of advisors and the gender pairing of students and advisors are related to gender differences in this direction.
The remainder of this study is organized as follows. The next section reviews the related work and proposes the research questions. The section of “Data and methods” introduces the details of data and empirical approaches. In the section of “Results”, the results are presented. The last section presents a discussion of the findings and implications for science policies.
Literature review and research questions
Early-career scientists and interdisciplinary research
Existing literature suggests that IDR can benefit scientists’ career advancement in the mid to long term, as it has the potential to foster scientific breakthroughs and lead to greater impact (Chen et al., 2015; Larivière et al., 2015; Okamura, 2019). Extensive empirical studies suggest that novel inventions often emerge from the combination of disparate knowledge domains (Fleming, 2001; Schilling and Green, 2011). Recent studies have shown that interdisciplinary publications tend to attract more citations (Zhang et al., 2021) and gain higher societal visibility (D’Este and Robinson-García, 2023). An increase in the effective number of disciplines is associated with a 20% increase in research impact at the article level (Okamura, 2019). Due to its potential to advance scientific frontiers, science policy, especially funding policy, and higher education systems in numerous countries actively promote IDR, and encourage interdisciplinary collaboration (James Jacob, 2015; Lyall et al., 2013; Rylance, 2015; Spelt et al., 2009). For instance, since the mid-1990s, federal research funding agencies have exerted growing pressure on the research community to embrace IDR (Rhoten and Pfirman, 2007).
However, some other researchers point out a non-linear relationship between IDR and scientific impacts, and the potential citation penalty of IDR due to a lack of recognition or perceived lower quality (Jacobs and Frickel, 2009; Leahey et al., 2017; Liu et al., 2017). Analyzing all articles that are indexed in the Web of Science (hereafter WoS) in 2000, a study finds an inverted U-shaped correlation between interdisciplinarity and papers’ scientific impacts (Larivière and Gingras, 2010). Some studies find that IDR receives fewer citations in natural and health sciences, compared to disciplinary research (Levitt and Thelwall, 2008), and different components of interdisciplinarity can have varying effects on short-term versus long-term citations (Wang et al., 2015). In addition, pursuing IDR requires significant time and effort to integrate knowledge from diverse disciplines, and such research may face difficulties in terms of publication and recognition compared to more traditional research (Blackmore and Kandiko, 2011). ECSs face the pressure of productivity evaluation through metrics like the number of papers published, citations, and journal impact factors (Nicholas et al., 2018). Therefore, pursuing IDR may lead to temporary productivity obstacles and challenges in gaining recognition, potentially impacting ECSs’ career advancement (Frodeman et al., 2017).
Involvement in IDR presents a mix of benefits and challenges for scientists. On the positive side, IDR has the potential to contribute to long-term career growth and foster innovative breakthroughs. However, pursuing an IDR path may temporarily hamper productivity and introduce uncertainties in securing academic positions of ECSs. In contrast to scientists at later career stages, ECSs might exhibit greater reluctance to engage in IDR due to the difficulties and challenges they face when pursuing this path. However, as the importance of IDR continues to grow and science policies advocate for it, we expect ECSs to increasingly involve themselves in IDR over time. Despite ongoing discussions regarding ECSs’ attitudes toward IDR, the historical evolution of IDR among ECSs remains unclear.
Gender and interdisciplinary research
Gender refers to the societal, cultural, and behavioral expectations, roles, and identities imposed on individuals based on their perceived sex (Eagly, 2013). It is important to differentiate gender from sex, primarily concerning the biological distinctions between males and females. Given the research focus of this study on ECSs’ engagement in IDR, we prioritize examining gender as it encompasses a broader range of social, psychological, and environmental factors that influence ECSs’ decision-making and engagement in IDR.
The discourse surrounding the correlation between gender and IDR is shaped by various perspectives including gender-related biological origins and psychological characteristics, cultural influences, and gender disparities in the research system. However, the existing literature presents contradictory findings regarding gender differences in IDR.
The literature on gender-related biological and psychological origins suggests that female scientists may be more inclined to engage in IDR. The neuroanatomical structure of human brains is one potential explanatory factor, as research findings suggest that females are better at assimilating diverse forms of information and making connections between ideas (Haier et al., 2005). Feminist science studies have long theorized that women may be less constrained by the norms of science (Harding, 1986). Moreover, research in science studies suggests that gender differences in selecting research problems. For instance, female scientists tend to be more oriented toward advancing knowledge that meets human needs, in contrast to males, who are more interested in conventional scientific motivations (Harding, 1992). Empirical studies, including surveys and analyses of collaboration patterns, also suggest that women scientists are more likely to engage in IDR activities and collaborations. For example, female graduate students are found to spend more time on cross-fertilization activities and participate in more cross-disciplinary knowledge creation (Rhoten and Pfirman, 2007). Women at Utrecht University are consistently found to engage more in IDR collaboration (Van Rijnsoever and Hessels, 2011). In addition, having female team members is linked to increased interdisciplinarity of collaborative research outputs (Pinheiro et al., 2022; Specht and Crowston, 2022).
Cultural factors, such as gender stereotypes, the prevailing “masculine” culture in STEM disciplines, and a lack of role models, can hinder the participation of female scientists in IDR. Societal expectations often associate traits like kindness, warmth, and helpfulness with women, while men are perceived as more analytical, competitive, and independent (Carli et al., 2016). Cultural pressures impose gender roles on female scientists, reinforcing these stereotypes. Women frequently face stereotype threats in STEM disciplines due to long-standing sociocultural biases surrounding successful cisgender white males (Corbett and Hill, 2015). It is believed that women lack achievement-oriented attributes that are necessary for success in male-dominated occupations (Noe, 1988). In addition, female scientists encounter another well-documented gender stereotype that questions their competence in mathematics and science (Bench et al., 2015). STEM fields have traditionally exhibited a strongly masculine culture, creating significant pressure on female scientists to conform to masculine norms within the scientific community. This can result in feelings of lower belonging in the field for women scientists (Blackburn, 2017). These cultural factors, along with a lack of female role models in science, contribute to identity threats among female scientists, i.e., a concern that their perceived weaknesses are attributed to themselves and women as a whole. This can significantly impact their confidence in pursuing careers in science (Gaule and Piacentini, 2018; Hirshfield, 2010). Negative social experiences further compound their insecurities, leading them to adopt safer research strategies and participate less in IDR.
In addition, the systematic barriers female scientists experience in the current research system may prevent them from engaging in IDR. The Matilda effect, which refers to the tendency for women’s abilities and contributions to be underestimated and undervalued, is particularly relevant in academia and may further discourage women scientists from pursuing IDR (Merton, 1973; Rossiter, 1993). The current modes of scientific practices and rewards can place female scientists into unequally competitive positions, particularly in some traditional and well-established disciplines (Bird, 2001; Etzkowitz and Kemelgor, 2001). The barriers female scientists have in the scientific community, such as unequal opportunities and discriminatory policies, might make them more risk-averse in their work, which could be the reason why female scientists choose to specialize in comparatively “niche” domains where they can be recognized more easily (Bird, 2001). As a result, female scientists may be discouraged from pursuing IDR or other ventures that carry greater risks and uncertainties, especially in the early stages of their careers. Empirical studies have supported this notion. For example, a survey suggests that junior female scientists feel discouraged from engaging in interdisciplinary collaboration and face additional obstacles compared to their male colleagues (Smith-Doerr and Croissant, 2016).
As discussed in the “Early-career scientists and interdisciplinary research” sub-section, involvement in IDR presents scientists with both benefits and challenges. IDR promotes long-term career growth and acts as a vital catalyst for fostering innovative breakthroughs, whereas pursuing an IDR trajectory may result in temporary productivity obstacles and introduce uncertainties about securing academic positions. In conjunction with cultural factors and the systematic barriers experienced by female scientists within the research system, female scientists are likely less inclined to engage in IDR, relative to their male peers, thereby creating a gender bias in this particular direction. The existing literature on gender differences in IDR presents inconclusive results. Many relevant studies are based on survey data with small sample sizes, which limits the generalizability of their findings. As a result, it remains ambiguous whether there are true gender differences in IDR, particularly among ECSs.
Relationship between mentorship and students’ engagement in interdisciplinary research
Doctoral advisors hold significant influence as both the “gatekeepers to the scholarly profession” and “socializing agents of the discipline.” The terms advisor, mentor, or research supervisor are frequently used interchangeably, even though the responsibilities associated with these roles may overlap or have distinct aspects. Advisors’ role is pivotal in molding students’ selection of research topics and their thesis writing (Bu et al., 2022). Typically, there are two primary approaches for students to determine their thesis topic: either it is assigned by advisors or students independently find and choose a topic after consultation with their advisors (Lei, 2009). The selection of a thesis topic depends on students’ interests, their capacity to conduct and complete the research, its manageability, and the demonstration of their independent mastery of the subject matter (I’Anson and Smith, 2004; Isaac et al., 1989; Pemberton, 2012). The principles of independence and originality form the foundation of students’ thesis writing (Isaac et al., 1992), with advisors providing crucial guidance in defining the theoretical framework, research questions, research design, and addressing any challenges encountered during the thesis writing process (Bargar and Duncan, 1982). Existing literature emphasizes the importance of both students and their advisors in the selection of thesis topics (de Kleijn et al., 2013; de Kleijn et al., 2012). For instance, Lei (2009) suggests that the decision-making process for selecting a thesis topic is complex and involves critical factors and resources taken into account by students and their advisors. Recognizing the advisors’ impactful role in shaping students’ research choices, the concept of academic genealogy has been introduced to measure intellectual inheritance propagated across generations of scientists through academic mentoring provided by advisors to their students (Rossi et al., 2017; Sugimoto et al., 2011).
The majority of existing literature has primarily concentrated on exploring the influence of advisors on their students’ thesis topic selection or thesis writing, with only a limited number of exceptions investigating the correlation between mentorship and the level of interdisciplinarity in students’ doctoral theses. Specifically, Sugimoto et al. (2011) conducted a study focusing on 3038 Ph.D. theses in the field of Library and Information Science (LIS), revealing that doctoral theses supervised by advisors from non-LIS disciplines exhibited a higher degree of interdisciplinarity compared to those guided by advisors with an LIS background. Similarly, an examination of LIS doctoral theses in North America further affirms the significance of advisors’ disciplinary backgrounds in shaping the interdisciplinarity levels of students’ theses (Mongeon et al., 2016).
The gender of advisors plays a crucial role in influencing students’ involvement in IDR. Extensive literature highlights the significance of advisors’ gender in shaping mentoring experiences and outcomes. Female and male advisors tend to exhibit different communication styles due to their varying preferences regarding connection and intimacy for females, and status and independence for males (Tannen, 1991). Studies suggest that female advisors often provide more social, nurturing, caring, and supportive forms of assistance compared to their male counterparts, which can be attributed to gender contexts, hierarchies, and socialization patterns (Canary and Dindia, 2009). In addition, research demonstrates that male advisors typically offer instrumental and career support, while female advisors often provide more emotional support (Allen and Eby, 2004). These gender differences can influence both the nature of the mentoring relationship and students’ mentoring experiences, subsequently affecting their thesis topic selection and engagement in IDR.
The pairing of students and advisors based on their gender can potentially impact students’ engagement in IDR. Extensive discussions have focused on the benefits of same-gender mentoring relationships, such as improved communication, more frequent meetings, higher satisfaction, increased psychological support, and role-model effects (Aguinis et al., 2018; Gaule and Piacentini, 2018; Milkman et al., 2015; Robinson, 2011; Rossello and Cowan, 2019). Empirical evidence suggests that same-race mentoring relationships are often associated with higher student productivity (Canaan and Mouganie, 2023; Gaule and Piacentini, 2018; Pezzoni et al., 2016; Rossello et al., 2020), although there are some mixed findings with negative effects or no effect on students’ performance in same-gender mentorship (Blake‐Beard et al., 2011; Hilmer and Hilmer, 2007). However, previous studies have not thoroughly examined whether and how the gender combination of students and advisors influences students’ research strategies and thesis topic selection. Given the varying outcomes observed in terms of students’ productivity and success resulting from different types of student-advisor gender pairings, this study anticipates that the gender combination of students and advisors is related to the quality of mentorship, the nature of the mentoring relationship, and students’ research strategies, including their engagement in IDR.
Interdisciplinarity measures
To measure IDR, a variety of methods have been proposed from both qualitative and quantitative perspectives. Qualitative measures of IDR normally rely on self-assessment by participants and peer review processes. However, these approaches may be limited in scalability and objectivity. In recent years, with the proliferation of bibliographic data, quantitative measures of interdisciplinarity based on bibliometric approaches have been developed at different levels, ranging from institutions (Rafols et al., 2012), research teams (Specht and Crowston, 2022), journals (Zhang et al., 2016), and individuals to publications (Liu et al., 2022).
Citation analysis is the most widely used bibliometric tool for measuring IDR with a focus on knowledge exchange or integration across fields. Among the citation-based methods, the percentage of citations outside of the discipline of the citing publication is commonly used as an indicator of IDR (Rafols and Meyer, 2007). In addition, the occurrence of discipline-specific citations pointing to other disciplines serves as a manifestation of knowledge exchange or integration among fields.
When citation data is unavailable, citation-based methods cannot be applied to measure IDR. In such cases, a top-down approach is applied using a pre-determined subject classification scheme that categorizes publications based on cognitive attributes (Bordons et al., 2004). Bibliometric databases, such as WoS, Scopus and PQDT, assign each publication to one or more disciplines or subjects based on these attributes. Co-occurrence relations of subjects can then map the collaborative network at the subject level and quantify the interdisciplinary degree of different subjects (Hu and Zhang, 2018; Karlovčec and Mladenić, 2015; Morillo et al., 2003; Porter et al., 2007)
The current methodologies for measuring interdisciplinarity in scientific documents, such as journal articles (Wang et al., 2015), patents (Huang and Su, 2019; Leydesdorff, 2018), and research proposals (Huutoniemi et al., 2010), do not provide clear guidance on assessing interdisciplinarity within doctoral theses. Furthermore, it is uncertain if existing methods can effectively capture the integration of knowledge across scientific domains within this specific type of scientific document.
Research questions
Having examined the relevant literature, this study identifies several research gaps. First, despite extensive assessment of the relationship between interdisciplinarity and scientific impact, there remains a lack of clarity regarding the participation of ECSs in IDR over the past few decades. The research strategies adopted by ECSs need to align with evolving societal needs and be guided by science and technology policies. The promotion of IDR has been driven by societal challenges, resulting in the formulation and reinforcement of relevant policies over time (Ash, 2019). Consequently, we expect that ECSs will exhibit changes in their engagement with IDR over time, closely following the increasing policy emphasis and support for IDR.
Second, the level of ECSs’ engagement in IDR may vary across universities with differing levels of research intensity due to variations in the universities’ dedication to IDR. Previous studies have indicated the growing adoption of interdisciplinarity as an institutional strategy among research universities in recent decades (Brint, 2005). Leading universities typically demonstrate stronger commitments to IDR and are better equipped to effectively implement IDR strategies, which may not be true for other institutions (Feller, 2004). In addition, ECSs’ involvement in IDR might also be influenced by their disciplinary contexts, as the challenges and requirements for conducting IDR can differ across disciplines (Wagner et al., 2011).
However, prior literature does not provide comprehensive insights into the prevalence of IDR among ECSs from the perspectives of temporal changes, disciplinary disparities, and universities with varying research intensity. In light of these gaps, we pose the following research question:
RQ1: What is the prevalence of interdisciplinary research among early-career scientists? To address RQ1, our analysis focuses on examining the temporal evolution of the interdisciplinarity indicator and its distributions across five scientific domains, as well as universities with different levels of research intensity.
The existing literature has not extensively explored the impact of gender on ECSs’ involvement in IDR. The effect is determined by both the effects of IDR on ECSs’ career development and the preferences female scientists have toward IDR. Previous studies suggest that IDR positively affects scientists’ mid-term and long-term career advancement while potentially impacting productivity negatively in the short term. Considering the relative disadvantage of female ECSs in the research system and the cultural barriers as discussed in the sub-section of “Gender and interdisciplinary research”, it is expected that they may adopt safer research strategies, leading to a lower level of engagement in IDR compared to male peers.
The gender disparities in ECSs’ participation in IDR may have evolved over time. Research indicates an increasing prominence of gender differences in productivity and scientific impact over the past 60 years due to the worsening sustainability of female scientists’ careers in academia (Huang et al., 2020). Consequently, it can be inferred that gender differences in ECSs’ engagement in IDR might have strengthened over time.
Moreover, disciplinary variations can play a role in gender disparities concerning ECSs’ involvement in IDR due to gender differences in performance, academic positions, research grants, and leadership across different fields (Brouns, 2000; Higher Education Funding Council for England, 2006; Huang et al., 2020). Furthermore, the underrepresentation of female scientists is notably more pronounced in top-tier institutions where highly competitive faculties and discriminatory policies are prevalent (Buchmann, 2009; Nadis, 2001). Thus, it is expected that gender differences in ECSs’ engagement in IDR will be particularly significant within universities with high research intensity, relative to that within ordinary universities.
To address these considerations, the following research question is proposed with a focus on temporal changes, disciplinary disparities, and varying research intensity among universities:
RQ2: How does gender influence ECSs’ engagement in IDR?
The existing literature does not provide clear insights into the connection between advisors’ gender and ECSs’ involvement in IDR. Previous studies primarily focus on whether advisors’ gender influences their students’ research success or likelihood of remaining in academia (Canaan and Mouganie, 2023; Gaule and Piacentini, 2018; Pezzoni et al., 2016; Rossello et al., 2020), as well as the importance of advisors on students’ thesis topic selection (de Kleijn et al., 2013; de Kleijn et al., 2012). However, these investigations do not shed light on whether advisors’ gender influences students’ research strategies or topic selection, particularly about IDR. Taking into account the wide range of gender differences related to communication and mentoring styles (Allen and Eby, 2004; Canary and Dindia, 2009; Eagly and Crowley, 1986), it is plausible that female and male advisors may establish different types of mentoring relationships with students. These relationships could potentially impact students’ overall mentoring experiences, as well as their choice of thesis topics and their level of engagement in IDR.
In addition, the existing literature does not explore whether the gender pairing of students and advisors is associated with gender differences in ECSs’ engagement in IDR. Advisors are important in the socialization process of Ph.D. students, and in shaping their perception of their status in science and their belonging to academia (Sallee, 2011). Given that the five scientific domains we investigate are generally male-dominated, female students working under the guidance of female advisors may be more cognizant of the unequal opportunities and negative stereotypes about women that female scientists face (Gaule and Piacentini, 2018), which increases female students’ sense of insecurity in academia. Consequently, relative to being supervised by male advisors, being supervised by female advisors might result in reduced participation of female students in IDR considering the potentially negative impact of IDR on their short-term productivity. In parallel, this reduction in participation may not be observed for male students under supervision by female advisors or female students under supervision by male advisors. Hence, the gender pairing of students and advisors may influence the gender disparities observed in ECSs’ involvement in IDR. Specifically, mentorship by female advisors could potentially amplify the gender differences in students’ engagement with IDR.
To address these research gaps, we propose a third research question:
RQ3: Is the gender of advisors correlated with students’ participation in IDR?
To investigate this research question, our study examines whether the gender of advisors is correlated with students’ participation in IDR and explores whether the interaction between advisors’ gender and students’ gender contributes to gender disparities in this regard.
Data and methods
PQDT dataset
The data for this study is obtained from the Sciences and Engineering Collection of PQDT, which contains nearly all doctoral theses from U.S. universities in science and engineering since 1861. This study analyzes doctoral theses in behavioral sciences, biological sciences, engineering, health and medical sciences, and mathematical and physical sciences, which are representative scientific domains. PQDT provides information about each doctoral thesis including the author’s name, advisor’s name, abstract, title, year of publication, university, subject, and so forth. Until 1980, PQDT did not provide information on the names of advisors. Hence, in the analyses involving predicted gender based on advisors’ first names, we exclusively concentrate on records that contain available advisor names. We obtained 1,109,491 theses from the database up to the end of 2016. Supplementary Table S1 provides basic statistics of the original dataset.
PQDT provides information on secondary subjects for each thesis.Footnote 1 A thesis author selects the field(s) of research associated with their work from a list of subject categories when submitting their thesis to PQDT. The classification system for subject categories has been relatively stable in recent decades, with only minor updates. In 2000, the National Center for Education Statistics introduced a new subject category of “interdisciplinary” to the classification system through the Classification of Instructional Programs (CIP2000).Footnote 2 This addition resulted in the inclusion of “interdisciplinary” as a subject option in PQDT.
Within PQDT, information is available regarding the names of advisors and all members comprising the thesis committee. To measure mentorship, we define mentorship specifically as the mentoring relationship established between the student and their advisor. In this context, the members of the thesis committee are not considered in the measurement of mentorship since they do not provide direct training or guidance to the students. In cases where students have multiple advisors, our focus narrows down to the major advisor, who is typically mentioned first in the list of advisors.Footnote 3 This allows us to concentrate on the primary advisor’s role in the mentorship process.
We determine the major scientific domain for each thesis by using PQDT’s classification scheme of subject categories and mapping each subject to a broader scientific domain. For cases where a thesis in PQDT is assigned more than one subject, we define the major scientific domain to be the subject listed first by the author. If a thesis is assigned only one subject, the scientific domain the subject belongs to is defined as the major scientific domain of the thesis.
The distribution of major scientific domains of theses in the original dataset is shown in Supplementary Table S2. Out of a variety of scientific domains included in PQDT, we focus on doctoral theses in behavioral sciences, biological sciences, engineering, health and medical sciences, and mathematical and physical sciences, resulting in 920,619 doctoral theses. These five scientific domains are most representative of hard science research in this dataset, accounting for almost 83% of all theses present in PQDT. Based on the Carnegie Classification of Institutions of Higher Education,Footnote 4 we categorize U.S. universities in the final dataset into three categories: R1 university (university with very high research intensity), R2 university (university with high research intensity), and other types of university.
Variables operationalization
This study aims to investigate the evolution of IDR in doctoral theses over the past six decades. This study also examines whether there are gender differences between female and male students in the interdisciplinarity level of doctoral theses and whether advisors’ gender plays a significant role in shaping the interdisciplinarity of doctoral theses. The dependent variable is the average distance between subjects assigned to a thesis. The key independent variable is students’ gender, which is inferred by the Gender-Guesser package.
Predicting gender information of students and advisors
Given that PQDT does not contain gender information for authors and advisors, we use the Gender-Guesser package,Footnote 5 an open-source Python module, to predict the gender of authors and advisors based on their first names. This tool has been developed using data from the program of “gender” by Jorg Michael and involves manual checks by native speakers of various countries for higher accuracy (Santamaría and Mihaljević, 2018). Gender-Guesser is viewed as one of the most advanced tools for predicting gender from first names and has been widely used in relevant literature (Squazzoni et al., 2021; Zhang et al., 2022). This tool classifies gender into six categories based on the first names: “male”, “female”, “mostly male”, “mostly female”, “andy” (androgynous), and “unknown” (name not found). The predicted gender statistics for students and advisors are presented in Table 1. The categories “mostly male” and “mostly female” suggest that a given name may be used by both men and women, but it is more frequently used by one gender. The “andy” category indicates that a name is used equally by both men and women, while the “unknown” category refers to a name that does not exist within the gender dataset. To ensure high accuracy in gender prediction, we only retain observations with a definite predicted gender attribute (“male” or “female”) for students.Footnote 6 The final dataset included 675,135 doctoral theses authored by students from 747 U.S. universities.
Measuring the interdisciplinarity of doctoral theses
As discussed in the sub-section of “Interdisciplinarity measures”, two commonly used approaches in measuring the interdisciplinarity of scientific documents are citation-based methods and methods based on pre-determined subject classification schemes. However, for doctoral theses within PQDT, citation data is not available, making it impossible to calculate citation-based interdisciplinarity measures. Moreover, traditional citation indexes like WoS and Scopus often exclude doctoral theses (Vera-Baceta et al., 2019), rendering citation-based indicators unreliable for reflecting interdisciplinarity levels in theses. To address these challenges, we employ co-occurrence matrices of subjects assigned to doctoral theses in PQDT. This approach allows us to measure the degree of interdisciplinarity present in doctoral research, offering a viable alternative to overcome the limitations of citation-based methods.
Doctoral theses in PQDT are classified into 552 secondary subjects, which collectively encompass 22 broader subject categories. The distributions of the number of subjects for all doctoral theses, female-authored doctoral theses, and male-authored doctoral theses are displayed in Fig. 1a–c. The distribution of the number of subjects for doctoral theses by scientific domain is presented in Supplementary Fig. S1a.
We further construct a continuous variable to proxy the degree of interdisciplinarity in each thesis at a fine-grained level. Specifically, we calculate the average distance or dissimilarity between subjects assigned to each thesis based on a co-occurrence matrix of subjects that is constructed using the assigned subjects for all theses in PQDT. For example, if a thesis is assigned three subjects, electrical engineering, nanotechnology, and energy, the three subjects are considered to have co-occurred with each other once. The co-occurrence matrix of 552 subjects in PQDT allows calculation of the cosine distance, dij, between any two subjects, denoted by i and j, which is calculated based on Eq. (1).
where Ci,j is the cosine similarity between subject i and j based on the co-occurrence matrix.
Following previous literature (Kim et al., 2022; Rafols and Meyer, 2007), for any thesis that is classified into more than one subject, we use Eq. (2) to calculate the average distance, i.e., Distancet, between subjects assigned to thesis t:
where i and j indicate two subjects that are assigned to thesis t; di,j refers to the cosine distance between subject i and j, and is calculated by Eq. (1). N indicates the number of subjects assigned to thesis t (n > 1). The larger the Distancet, the more interdisciplinary the thesis is. Distancet is only available for theses that are assigned more than one subject. If a thesis is classified to only one subject, we define the value of Distancet as 0, which indicates that this thesis is not interdisciplinary at all. The distributions of the average distance between subjects for all doctoral theses, female-authored doctoral theses and male-authored doctoral theses are shown in Fig. 1d–f. The distribution of the average distance between subjects for doctoral theses by scientific domain is presented in Supplementary Fig. S1b. The statistics regarding the gender of students, and the average distance between subjects of theses across five scientific domains are shown in Table 2.
To address RQ1, we utilize descriptive analyses to examine the extent to which ECSs participated in IDR based on their doctoral theses. We investigate the temporal change of the interdisciplinarity indicator as well as its distributions across five scientific domains and universities with varying research intensity.
Regression analyses
To investigate RQs 2 and 3, we examine potential differences between female and male students in the level of involvement in IDR as reflected in their doctoral theses. We also explore the influence of advisors’ gender on shaping students’ engagement in IDR, as well as the possible relationship between the gender combination of students and advisors and the magnitude of gender differences observed in students’ participation in IDR. We use t to denote a thesis. Equation (3) is used to estimate the dependent variable, i.e., Distancet that refers to the average distance of subjects assigned to thesis t.
where female studentt is a dummy variable that reflects whether or not the student, i.e., the author of thesis t, is female. We introduce female advisort to Eq. (3) as an explanatory variable that is a dummy variable that reflects whether or not the advisor is female. Building upon the discussions in the “Relationship between mentorship and students’ engagement in interdisciplinary research” and “Research questions” sub-sections, which highlight the impact of the gender combination of advisors and students on students’ engagement in IDR, we introduce an interaction term between female studentt and female advisort into Eq. (3) to empirically test the validity of this assumption. Year fixed effects, i.e., Yt, are added to control time-variant unobserved changes, such as policy changes that support or discourage IDR; university fixed effects, i.e., Ut, are included to control possible effects of university characteristics on students’ preferences for IDR; fixed effects concerning scientific domains, i.e., Dt, are incorporated to control the major scientific domains in which students work. Equation (3) is estimated by an ordinary least squares (OLS) regression model. The average variance inflation factor (VIF) for all explanatory variables obtained is 2.03, which is significantly lower than the threshold value of 5. This suggests that there are no issues of multicollinearity present in the regression model.
Results
Evolution of interdisciplinary doctoral theses
In general, doctoral theses by U.S. Ph.D. students in the five scientific domains, have demonstrated a growing trend toward IDR. Doctoral theses that are assigned multiple subjects are considered interdisciplinary doctoral theses. Based on this definition, in period 1, i.e., before 2000, interdisciplinary doctoral theses only accounted for 32.1% of the total doctoral theses (Fig. 2a). This proportion increased to 59.5% during the second period, i.e., from 2000 to 2016, suggesting that IDR has become dominant in doctoral theses. The increasingly higher proportion of interdisciplinary doctoral theses is observed across scientific domains, and universities of varying research quality (Fig. 2b, d). From Supplementary Fig. S2, we find that since the 1990s, interdisciplinary studies have been dominant in doctoral theses.
The growing trend toward IDR in doctoral theses is also reflected by the increasing average distance between subjects assigned to doctoral theses (Fig. 2a), which is found across five scientific domains, and universities of different types (Fig. 2c, e). This trend suggests that, even within interdisciplinary doctoral theses, i.e., doctoral theses that are assigned multiple subjects based on our definition, research subjects with greater cognitive distance are being combined and integrated into doctoral research. Supplementary Fig. S2 displays the year-by-year temporal evolution of the proportion of IDR and the average distance between subjects.
To gain insights into the variations in the average distance between subjects, we conduct comparisons across different periods, scientific domains, and universities with varying research intensity. The findings confirm that there has been an increase in the average distance between subjects over time (Fig. 3a). Disciplinary disparities are also identified. Specifically, among the five scientific domains examined, mathematics and physical sciences showcase the lowest level of interdisciplinarity in doctoral theses (Fig. 3b). On the other hand, biological sciences, and health and medical sciences exhibit relatively higher levels of interdisciplinarity. Moreover, universities with high research intensity (R2 universities) demonstrated significantly higher levels of interdisciplinarity, compared to ordinary universities (Fig. 3c).
Gender disparities in interdisciplinary doctoral theses
Our analyses uncover the presence of gender disparities in the interdisciplinarity level of doctoral theses. The results of the Welch’s t test indicate a significant difference in the average distance between subjects in doctoral theses authored by female and male students, with female-authored theses having an average distance ~0.09 higher than that of male-authored theses (Supplementary Fig. S3a). When examining the historical changes, the average distance between subjects in doctoral theses by female and male students almost exhibits overlapping trends. Simple comparisons of the average distance between subjects between female- and male-authored theses suggest that female students combine and integrate research subjects with slightly greater cognitive distance than their male counterparts do. However, the simple comparisons fail to consider the potential effects caused by factors, such as universities, scientific domains, periods and advisors’ gender that may influence students’ preferences for IDR, as discussed in the section of “Literature review and research questions”. To address this issue, we use a multivariate regression model to estimate gender differences in the interdisciplinarity indicator while controlling for influential factors concerning the above aspects.
Gender discrepancies are evidenced by the greater average distance between subjects in doctoral theses by males. Overall, the results of the multivariable regression models indicate that the average distance between subjects in male-authored doctoral theses is significantly greater than that in female-authored theses by around 0.012 (P < 0.01), which is approximately 6% of the overall sample mean (Fig. 4a and column 1 of Table 3). This finding consistently holds across various periods (Fig. 4b, c and columns 3 and 5 of Table 3), universities of different research intensity (Fig. 5), and most scientific domains (Fig. 6). This observation suggests that the gender disparities in students’ engagement observed in doctoral theses are prevalent across periods, university quality, and scientific domains.
However, it should be noted that the magnitude of gender differences in the average distance between subjects varies across periods, university quality, and scientific domains. The gender disparity in the average distance between subjects is more pronounced during the second period (2000–2016) compared to the period spanning from 1950 to 1999 (Fig. 4b, c). This discovery implies an increase in gender disparities regarding the level of interdisciplinarity in doctoral theses over time. Besides, the average distance between subjects in male-authored doctoral theses is higher than in female-authored doctoral theses to a greater level in the top-tier universities (Fig. 5), i.e., R1 universities, relative to the remaining two types of universities. Additionally, disciplinary differences are found across scientific domains. Figure 6 indicates significant gender differences that exist in the interdisciplinarity indicator between female students and male students in the scientific domains including behavioral science, biological science, and engineering. However, in health and medical sciences, and mathematical science and physical sciences, the gender differences are not very significant.
The adjusted R2 values in the regression models are not high, especially for the regression analyses for observations from 2000 to 2016 (Table 3). In the regression model that focuses on the period of 1950–1999, the adjusted R2 is ~10%, suggesting that the model explains 10% of variations within the data. However, the adjusted R2 of the regression model that investigates the period of 2000–2016 only approaches around 5%. The relatively low R2 in the regression models suggest that besides the explanatory variables we use in this study, other important factors influence interdisciplinarity level in doctoral theses, such as students’ personalities, attitudes toward IDR, and so forth. Furthermore, we conduct residual analyses to evaluate the suitability of the regression models (Supplementary Fig. S4). Overall, the findings indicate that a linear regression model is appropriate for this study.
Furthermore, we find that the advisor’s gender is not significant in shaping gender disparities in the interdisciplinarity level of doctoral theses (columns 1, 3, and 5 in Table 3). Alternatively, students who are supervised by female advisors and those who are supervised by male advisors are not significantly distinctive concerning the average distance between subjects in their doctoral theses.
Our findings reveal a significant negative interaction term between female student and female advisor (Fig. 7a–c and columns 2 and 6 in Table 3), indicating that the gender combination of students and advisors influences the average distance between subjects covered in their doctoral theses. Specifically, the gender difference in the average distance between subjects is more pronounced when students are mentored by female advisors compared to male advisors. This suggests that being supervised by female advisors might strengthen the gender difference in students’ engagement in IDR.
Robustness check
We apply four methods to investigate the robustness of the major findings in this study. First, to minimize the potential bias in gender prediction, we apply Genderize.io, another commonly used gender inference tool (Sebo, 2021), to predict the gender of students and advisors, and perform the above analyses again. Genderize.io collects data from all over the Web, assigning gender to each given name based on the proportion of people with that name who are men or women. While Genderize.io exhibits high gender prediction accuracy, it does have certain limitations. These include a bias toward English names and potential unreliability concerning the sources utilized within the tool (Santamaría and Mihaljević, 2018). We retain names with a gender probability of 0.9 and above in the Genderize.io database. The predicted gender of students and advisors in five scientific domains in PQDT are shown in Supplementary Tables S3 and S4, respectively. The regression results are shown in Supplementary Table S5. Generally, we observe the consistent gender difference in the average distance between subjects in doctoral theses.
Furthermore, we incorporate doctoral thesis data where students’ gender is predicted as “mostly female” or “mostly male”. To maintain consistency, we reclassify instances labeled as “mostly female” to “female” and those labeled as “mostly male” to “male”. The regression results are shown in Supplementary Table S6, suggesting the persistent and lower interdisciplinarity of female-authored doctoral theses.
We further utilize the doc2vec model to assess the interdisciplinarity of doctoral theses based on their titles. Doc2vec is an unsupervised learning algorithm and an extension of the word2vec model (Le and Mikolov, 2014). It is used to vectorize input documents by capturing contextual information and preserving semantic relations. This tool is widely used because of its performance advantages over its alternatives, such as the TF-IDF models (Kim et al., 2017). Using the “gensim” library, we leverage the doc2vec model to represent each secondary subject in PQDT as vectors for comparing semantic distances between subjects. Following previous literature (Alasehir and Acarturk, 2022; Li et al., 2023; Whalen et al., 2020), we employ the PV-DM model with vector dimensions, learning rate, and window length set to 100, 0.025, and 3, respectively. We combine the titles of all doctoral theses assigned to each subject, resulting in a collection of 552 text blocks corresponding to 552 secondary subjects. These text blocks serve as input data for training the doc2vec model, which generates 100-dimensional vector representations for each secondary subject. These vectors can be viewed as points in a multidimensional semantic space, representing the content of each subject. To measure the distance between subjects, we calculate the cosine distance between each pair of subjects using the 100-dimensional vectors. Drawing from Eq. (2), we compute the average distance between subjects in the doctoral theses as a measure of interdisciplinarity and perform regression analyses using this metric. The regression results, as presented in Supplementary Table S7, indicate a persistent pattern of lower participation in IDR among female students.
Advisors’ characteristics, such as preferences for IDR, personalities, and mentoring styles may be related to students’ engagement in IDR. For robustness check, we examine whether the higher interdisciplinarity level of male-authored doctoral theses than that of female-authored doctoral theses still holds, even when female students and male students are supervised by the same advisor. We include advisor-fixed effects in Eq. (3) to account for advisor-specific and time-invariant characteristics. Upon accounting for the influence of advisors by adding advisor-fixed effects in regression models, Supplementary Table S8 indicates that male students consistently exhibit a higher level of interdisciplinarity in their theses, compared to female students, even when male and female students are supervised by the same advisor.
Discussion and conclusion
This study finds persistent and pervasive gender imbalance in interdisciplinary research based on doctoral theses by 675,135 U.S. Ph.D. students in five scientific domains from 1950 to 2016. We find that interdisciplinary doctoral theses have witnessed a growing trend across different scientific domains and universities of different research quality, which is in line with the rise of IDR and education. Since the 1990s, interdisciplinary studies have been dominant in doctoral theses. The average distance between subjects in doctoral theses is growing as well, which suggests that Ph.D. students are inclined to combine different types of disciplinary knowledge that are increasingly cognitively distant from each other. The finding suggests that Ph.D. students are becoming more and more adaptive to the growing complex social and economic problems, and get more engaged in IDR.
This study further finds not too substantial but persistent gender differences in ECSs’ engagement in IDR across the five hard science disciplines. This finding is evidenced by the longer average distance between subjects of male-authored doctoral theses, relative to that of female-authored doctoral theses. Specifically, the average distance between subjects of male-authored doctoral theses is significantly longer than that of doctoral theses by female students by 0.012, namely around 6% of the sample mean.
Our findings indicate that advisors’ gender is not significantly associated with the average distance between subjects in students’ doctoral theses. While we discussed the potential influence of advisors’ gender on students’ thesis topic selection in the “Relationship between mentorship and students’ engagement in interdisciplinary research” sub-section, it appears that advisors’ gender may not directly impact their students’ involvement in IDR. Other attributes and characteristics of advisors, such as their academic background and active participation in IDR, may be more influential in this regard. Further research should explore these aspects in greater detail.
Our analysis further confirms that the gender pairing of students and advisors magnifies the gender disparity in the level of interdisciplinarity observed in doctoral theses. This finding aligns with the hypothesis presented in the sub-section of “Research questions”. Specifically, when female students are mentored by female advisors, they tend to exhibit a heightened awareness of cultural factors and systemic barriers that present challenges for female scientists. As a result, these students may adopt more cautious and conservative research strategies to mitigate potential negative impacts on their career advancement that could arise from engaging in IDR. In contrast, no similar considerations appear to influence male students under the guidance of female advisors or students of any gender supervised by male advisors.
The findings mentioned above are in line with the earlier discussions regarding the systematic obstacles female students encounter when engaging in IDR, stemming from gender biases that are prevalent in current scientific practices and rewards. Our results suggest that female students exhibit less engagement in IDR and a lower degree of interdisciplinarity, relative to their male peers. This gender disparity displays a growing trend over time and is more prominent in top-tier universities and some scientific domains, such as biological science.
Considering the high potential of IDR, being less interdisciplinary during the early-career stages may pose a risk to female scientists’ future career progression, exacerbating gender disparities in science. Moreover, existing evidence suggests that female scientists and males differ in terms of selecting research topics and questions. For example, female-dominated teams tend to investigate research inquiries related to women’s health and medical needs (Koning et al., 2021). Being less interdisciplinary could hinder the generation of effective solutions to issues relevant to females and adversely impact their well-being. The results of this study have broad implications for university managers, funding agencies, and policymakers. To mitigate gender gaps in IDR for ECSs, addressing the systematic gender biases within current scientific practices and rewards is critical, particularly in terms of alleviating gender-based inequalities in the job market, promotion assessment, and funding allocation. Science policy and funding agencies should provide funding support for female scientists, particularly those at the early-career stages, who engage in IDR or work in interdisciplinary research centers, thus fostering an environment conducive to their participation in IDR. Furthermore, our findings suggest that being supervised by female advisors exacerbates the gender disparity in ECSs’ engagement in IDR. This implies that providing additional support to female faculty members may not only directly benefit their career development but also hold significant importance in fostering the growth and success of future generations of female scientists. It is important to acknowledge that the findings of this study are specific to the five hard science disciplines and may not apply to other domains, particularly humanities and social sciences.
This study has a few limitations. Informal mentorship plays a crucial role in shaping early-career scientists’ trajectories. Thus, whether or not the results of this study hold for informal mentoring relationships requires further investigation. Second, although we utilize large-scale data on doctoral theses, our analyses only investigate five scientific domains; it should be useful to validate our findings across other fields, particularly in humanities and social sciences. Furthermore, although doctoral theses reflect the culmination of students’ research training, and serve as a significant academic accomplishment, students may apply different research strategies in their doctoral theses, and other publications, such as journal articles or conference papers. Therefore, future studies should investigate whether the results of this study also apply to ECSs’ other publications. In addition, it is important to consider various other influential factors that shape ECSs’ preferences for IDR, including their personalities, advisors’ mentoring styles, and their attitudes toward IDR. These factors warrant further investigations in future studies. In this study, we employ first names as a proxy for inferring the gender of students and advisors. However, it is important to note that first names only reflect the gender assigned at birth and may not accurately capture their social gender. To overcome this limitation, conducting survey research could serve as a valuable complement, enabling a more comprehensive measurement of the social gender of both students and advisors.
Data availability
The main dataset we adopted is ProQuest which contains doctoral dissertation bibliographic data. ProQuest is under subscription under the authors’ affiliations. Despite this, per the requirement of Clarivate, the raw dataset cannot be spread to the public. Data are available from the authors upon reasonable request.
Notes
Subject categories are typically chosen when students submit their theses for publishing. In most cases, each student submits their thesis by themselves, but there are instances where universities may also submit theses on behalf of the students. To facilitate and streamline this process, PQDT offers the ETD (Electronic Theses and Dissertations) administrator system, which helps manage and optimize the electronic submission of dissertations and theses. If a degree-granting university can accept submissions through the ETD system, students at that university can submit their thesis for publishing through that particular submission site. Alternatively, if a university does not have an ETD site, students can utilize the ProQuest Direct publishing ETD site for their submissions. It is essential to note that the ProQuest subject category list aligns with their internal system and is not primarily determined by institutional libraries.
Please refer to CIP2000 from the link to https://nces.ed.gov/pubs2002/cip2000/.
Out of all the records where complete advisor information is available from ProQuest, less than 4% of students are found to have multiple advisors. This implies that in the majority of cases, ProQuest maintains only the main advisor. Therefore, excluding other advisors from the dataset, will not introduce any biases into our results.
We have not included non-binary categories of gender in our analysis because the gender inferring tool we utilize does not provide predictions for non-binary gender categories.
References
Aguinis H, Ji YH, Joo H (2018) Gender productivity gap among star performers in STEM and other scientific fields. J Appl Psychol 103(12):1283
Alasehir O, Acarturk C (2022) Interdisciplinarity in cognitive science: a document similarity analysis. Cogn Sci 46(12):e13222
Allen TD, Eby LT (2004) Factors related to mentor reports of mentoring functions provided: gender and relational characteristics. Sex Roles 50:129–139
Ash MG (2019) Interdisciplinarity in historical perspective. Perspect Sci 27(4):619–642
Bargar RR, Duncan JK (1982) Cultivating creative endeavor in doctoral research. J High Educ 53(1):1–31
Bench SW, Lench HC, Liew J, Miner K, Flores SA (2015) Gender gaps in overestimation of math performance. Sex Roles 72:536–546
Bird E (2001) Disciplining the interdisciplinary: radicalism and the academic curriculum. Br J Sociol Educ 22(4):463–478
Blackburn H (2017) The status of women in STEM in higher education: a review of the literature 2007–2017. Sci Technol Libraries 36(3):235–273
Blackmore P, Kandiko CB (2011) Interdisciplinarity within an academic career. Res Post‐Compuls Educ 16(1):123–134
Blake‐Beard S, Bayne ML, Crosby FJ, Muller CB (2011) Matching by race and gender in mentoring relationships: keeping our eyes on the prize. J Soc Issues 67(3):622–643
Bordons M, Morillo F, Gómez I (2004) Analysis of cross-disciplinary research through bibliometric tools. In Moed HF, Glänzel W, Schmoch U (eds) Handbook of quantitative science and technology research: the use of publication and patent statistics in studies of S&T systems. Springer, p 437–456
Brint S (2005) Creating the future:‘New directions’ in American research universities. Minerva 43(1):23–50
Brouns M (2000) The gendered nature of assessment procedures in scientific research funding: the Dutch case. High Educ Eur 25(2):193–199
Bu Y, Li H, Wei C, Liu M, Li J (2022) On the relationship between supervisor–supervisee gender difference and scientific impact of doctoral dissertations: evidence from Humanities and Social Sciences in China. J Inf Sci 48(4):492–502
Buchmann C (2009) Gender inequalities in the transition to college. Teach Coll Rec 111(10):2320–2346
Canaan S, Mouganie P (2023) The impact of advisor gender on female students’ STEM enrollment and persistence. J Hum Resour 58(2):593–632
Canary DJ, Dindia K (2009) Sex differences and similarities in communication. Taylor & Francis
Carli LL, Alawa L, Lee Y, Zhao B, Kim E (2016) Stereotypes about gender and science: women≠ scientists. Psychol Women Q 40(2):244–260
Charness G, Gneezy U (2012) Strong evidence for gender differences in risk taking. J Econ Behav Organ 83(1):50–58
Chen S, Arsenault C, Larivière V (2015) Are top-cited papers more interdisciplinary? J Informetr 9(4):1034–1046
Corbett C, Hill C (2015) Solving the equation: the variables for women’s success in engineering and computing. ERIC
Correll SJ (1997) Talking about leaving: why undergraduates leave the sciences. Contemp Sociol 26(5):644
D’Este P, Robinson-García N (2023) Interdisciplinary research and the societal visibility of science: the advantages of spanning multiple and distant scientific fields. Res Policy 52(2):104609. https://doi.org/10.1016/j.respol.2022.104609
Donner P (2021) Citation analysis of Ph.D. theses with data from Scopus and Google Books. Scientometrics 126(12):9431–9456. https://doi.org/10.1007/s11192-021-04173-w
Eagly AH, Crowley M (1986) Gender and helping behavior: a meta-analytic review of the social psychological literature. Psychol Bull 100(3):283
Eagly AH, Nater C, Miller DI, Kaufmann M, Sczesny S (2020) Gender stereotypes have changed: a cross-temporal meta-analysis of US public opinion polls from 1946 to 2018. Am Psychol 75(3):301
Eagly AH (2013) Sex differences in social behavior: a social-role interpretation. Psychology Press
Eckel CC, Grossman PJ (2008) Men, women and risk aversion: experimental evidence. Handb Exp Econ results 1:1061–1073
Etzkowitz H, Kemelgor C (2001) Introduction: gender inequality in science: a universal condition? Minerva 39(2):153–174
Etzkowitz H, Kemelgor C, Uzzi B (2000) Athena unbound: the advancement of women in science and technology. Cambridge University Press
Feller I (2004) Whither interdisciplinarity (in an era of strategic planning). Paper presented at the Annual Meeting of the American Association for the Advancement of Science, Seattle, WA
Fleming L (2001) Recombinant uncertainty in technological search. Manag Sci 47(1):117–132
Fox MF (2005) Gender, family characteristics, and publication productivity among scientists. Soc Stud Sci 35(1):131–150
Frodeman R, Klein JT, Pacheco RCDS (2017) The Oxford handbook of interdisciplinarity. Oxford University Press
Gaule P, Piacentini M (2018) An advisor like me? Advisor gender and post-graduate careers in science. Res Policy 47(4):805–813
Haier RJ, Jung RE, Yeo RA, Head K, Alkire MT (2005) The neuroanatomy of general intelligence: sex matters. NeuroImage 25(1):320–327
Harding S (1992) After the Neutrality Ideal: science, Politics, and “Strong Objectivity”. Soc Res 59(3):567–587
Harding SG (1986) The science question in feminism. Cornell University Press
Heiberger RH, Munoz-Najar Galvez S, McFarland DA (2021) Facets of specialization and its relation to career success: an analysis of US Sociology, 1980 to 2015. Am Sociol Rev 86(6):1164–1192
Higher Education Funding Council for England (2006) The higher education workforce in England: a framework for the future. HEFCE. Retrieved 03 Dec 2023 from http://hdl.voced.edu.au/10707/90731
Hilmer C, Hilmer M (2007) Women helping women, men helping women? Same-gender mentoring, initial job placements, and early career publishing success for economics PhDs. Am Econ Rev 97(2):422–426
Hirshfield LE (2010) "She won't make me feel dumb": identity threat in a male-dominated discipline. Int J Gender Sci Technol 2(1):5–24
Hsu G, Hannan MT, Koçak Ö (2009) Multiple category memberships in markets: an integrative theory and two empirical tests. Am Sociol Rev 74(1):150–169
Hu J, Zhang Y (2018) Measuring the interdisciplinarity of Big Data research: a longitudinal study. Online Inf Rev 42(5):681–696
Huang H-C, Su H-N (2019) The innovative fulcrums of technological interdisciplinarity: an analysis of technology fields in patents. Technovation 84:59–70
Huang J, Gates AJ, Sinatra R, Barabási A-L (2020) Historical comparison of gender inequality in scientific careers across countries and disciplines. Proc Natl Acad Sci USA 117(9):4609–4616
Huutoniemi K, Klein JT, Bruun H, Hukkinen J (2010) Analyzing interdisciplinarity: typology and indicators. Res Policy 39(1):79–88
I’Anson RA, Smith KA (2004) Undergraduate research projects and dissertations: issues of topic selection, access and data collection amongst tourism management students. J Hospitality Leis Sport Tour Educ 3(1):19–32
Isaac PD, Quinlan SV, Walker MM (1992) Faculty perceptions of the doctoral dissertation. J High Educ 63(3):241–268
Isaac PD, Koenigsknecht RA, Malaney GD, Karras JE (1989) Factors related to doctoral dissertation topic selection. Res High Educ 30:357–373
Jacobs JA, Frickel S (2009) Interdisciplinarity: a critical assessment. Annu Rev Sociol 35:43–65
James Jacob W (2015) Interdisciplinary trends in higher education. Palgrave Commun 1(1):1–5
Karlovčec M, Mladenić D (2015) Interdisciplinarity of scientific fields and its evolution based on graph of project collaboration and co-authoring. Scientometrics 102:433–454
Kim H, Park H, Song M (2022) Developing a topic-driven method for interdisciplinarity analysis. J Informetr 16(2):101255
Kim HK, Kim H, Cho S (2017) Bag-of-concepts: comprehending document representation through clustering words in distributed representation. Neurocomputing 266:336–352
de Kleijn RAM, Mainhard MT, Meijer PC, Pilot A, Brekelmans M (2012) Master’s thesis supervision: relations between perceptions of the supervisor–student relationship, final grade, perceived supervisor contribution to learning and student satisfaction. Stud High Educ 37(8):925–939. https://doi.org/10.1080/03075079.2011.556717
de Kleijn RAM, Mainhard MT, Meijer PC, Brekelmans M, Pilot A (2013) Master’s thesis projects: student perceptions of supervisor feedback. Assess Eval High Educ 38:1012–1026
Koning R, Samila S, Ferguson J-P (2021) Who do we invent for? Patents by women focus more on women’s health, but few women get to invent. Science 372(6548):1345–1348
Kovacs B, Lex R (2012) Contrasting alternative explanations for the negative effects of category spanning. Paper presented at the Academy of Management Proceedings, Boston, USA, August 3-7
Larivière V, Gingras Y (2010) On the relationship between interdisciplinarity and scientific impact. J Am Soc Inf Sci Technol 61(1):126–131
Larivière V, Haustein S, Börner K (2015) Long-distance interdisciplinarity leads to higher scientific impact. PLoS ONE 10(3):e0122565
Larivière V, Vignola-Gagné E, Villeneuve C, Gélinas P, Gingras Y (2011) Sex differences in research funding, productivity and impact: an analysis of Québec university professors. Scientometrics 87(3):483–498
Le Q, Mikolov T (2014) Distributed representations of sentences and documents. Paper presented at the International conference on machine learning, Beijing, China, June 21–26
Leahey E, Beckman CM, Stanko TL (2017) Prominent but less productive: the impact of interdisciplinarity on scientists’ research. Adm Sci Q 62(1):105–139
Lee DH (2019) Predicting the research performance of early career scientists. Scientometrics 121(3):1481–1504
Lei SA (2009) Strategies for finding and selecting an ideal thesis or dissertation topic: a review of literature. Coll Stud J 43:1324–1332
Levitt JM, Thelwall M (2008) Is multidisciplinary research more highly cited? A macrolevel study. J Am Soc Inf Sci Technol 59(12):1973–1984. https://doi.org/10.1002/asi.20914
Leydesdorff L (2018) Diversity and interdisciplinarity: how can one distinguish and recombine disparity, variety, and balance? Scientometrics 116:2113–2121
Li X, Tang X, Lu W (2023) Tracking biomedical articles along the translational continuum: a measure based on biomedical knowledge representation. Scientometrics 128(2):1295–1319
Liu M, Shi D, Li J (2017) Double-edged sword of interdisciplinary knowledge flow from hard sciences to humanities and social sciences: evidence from China. PLoS ONE 12(9):e0184977
Liu M, Hu X, Li J (2018) Knowledge flow in China’s humanities and social sciences. Qual Quant 52:607–626
Liu M, Bu Y, Chen C, Xu J, Li D, Leng Y, Sung M (2022) Pandemics are catalysts of scientific novelty: evidence from COVID-19. J Assoc Inf Sci Technol 73(8):1065–1078
Liu M, Zhang N, Hu X, Jaiswal A, Xu J, Chen H, Bu Y (2022). Further divided gender gaps in research productivity and collaboration during the COVID-19 pandemic: evidence from coronavirus-related literature. J Informetrics, Cologne, Germany 16(2). https://doi.org/10.1016/j.joi.2022.101295
Liu S, Chen H, Bu Y (2022) Comparing different perspectives of characterizing interdisciplinarity of scientific publications: author vs. publication perspectives. Paper presented at the Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries, Cologne, Germany, June 20–24
Lyall C, Bruce A, Marsden W, Meagher L (2013) The role of funding agencies in creating interdisciplinary knowledge. Sci Public Policy 40(1):62–71
de Macedo LSV, Jacobi PR, de Oliveira JAP (2023) Paradiplomacy of cities in the global south and multilevel climate governance: evidence from Brazil. Glob Public Policy Gov 3(1):86–115. https://doi.org/10.1007/s43508-023-00060-7
Merton RK (1973) The sociology of science: theoretical and empirical investigations. University of Chicago Press
Milkman KL, Akinola M, Chugh D (2015) What happens before? A field experiment exploring how pay and representation differentially shape bias on the pathway into organizations. J Appl Psychol 100(6):1678
Mongeon P, Shu F, Larivière V, Julien C-A, Piper A (2016) On the evolution of library and information science doctoral dissertation topics in North America (1960-2013). J Educ Libr Inf Sci 57:131–142
Morillo F, Bordons M, Gómez I (2003) Interdisciplinarity in science: a tentative typology of disciplines and research areas. J Am Soc Inf Sci Technol 54(13):1237–1249
Nadis S (2001) Top research universities face up to gender bias. Nature 409(6821):653–654
National Academy of Sciences (2004) Facilitating interdisciplinary research. National Academies Press
Nicholas D, Herman E, Xu J, Boukacem-Zeghmouri C, Abdullah A, Watkinson A, Rodríguez-Bravo B (2018) Early career researchers’ quest for reputation in the digital age. J Sch Publ 49(4):375–396
Noe R (1988) Women and mentoring: a review and research agenda. The Academy of Management Review, 13. https://doi.org/10.2307/258355
Okamura K (2019) Interdisciplinarity revisited: evidence for research impact and dynamism. Palgrave Commun 5(1):1–9
Pemberton CLA (2012) A “How-to” guide for the education thesis/dissertation process. Kappa Delta Pi Rec 48(2):82–86
Pezzoni M, Mairesse J, Stephan P, Lane J (2016) Gender and the publication output of graduate students: a case study. PLoS ONE 11(1):e0145146
Pinheiro H, Durning M, Campbell D (2022) Do women undertake interdisciplinary research more than men, and do self-citations bias observed differences? Quant Sci Stud 3(2):363–392
Porter A, Cohen A, David Roessner J, Perreault M (2007) Measuring researcher interdisciplinarity. Scientometrics 72(1):117–147
Puppim de Oliveira JA, Qian H (2023) Perspectives in global environmental governance. Glob Public Policy Gov 3(1):5–11
Rafols I, Meyer M (2007) How cross-disciplinary is bionanotechnology? Explorations in the specialty of molecular motors. Scientometrics 70:633–650
Rafols I, Leydesdorff L, O’Hare A, Nightingale P, Stirling A (2012) How journal rankings can suppress interdisciplinary research: A comparison between innovation studies and business & management. Res Policy 41(7):1262–1282
Rhoten D, Pfirman S (2007) Women in interdisciplinary science: exploring preferences and consequences. Res Policy 36(1):56–75
Rier DA (2003) Gender, lifecourse and publication decisions in toxic-exposure epidemiology: now!‘versuswait a minute!’. Soc Stud Sci 33(2):269–300
Van Rijnsoever FJ, Hessels LK (2011) Factors associated with disciplinary and interdisciplinary research collaboration. Res Policy 40(3):463–472
Robinson CV (2011) In pursuit of female chemists. Nature 476(7360):273–275. https://doi.org/10.1038/476273a
Rossello G, Cowan R (2019) Far from random? the role of homophily in student supervision. Working paper, Maastricht Economic and social Research institute on Innovation and Technology
Rossello G, Cowan R, Mairesse J (2020). Ph. D. research output in STEM: the role of gender and race in supervision. Working paper, Maastricht Economic and social Research institute on Innovation and Technology
Rossi L, Freire IL, Mena-Chalco JP (2017) Genealogical index: a metric to analyze advisor–advisee relationships. J Informetr 11(2):564–582
Rossiter MW (1993) The Matthew Matilda effect in science. Soc Stud Sci 23(2):325–341
Rylance R (2015) Global funders to focus on interdisciplinarity [Commentary]. Nature 525(7569):313–315
Sá C, Cowley S, Martinez M, Kachynska N, Sabzalieva E (2020) Gender gaps in research productivity and recognition among elite scientists in the US, Canada, and South Africa. PLoS ONE 15(10):e0240903
Sallee MW (2011) Performing masculinity: considering gender in doctoral student socialization. J High Educ 82(2):187–216
Santamaría L, Mihaljević H (2018) Comparison and benchmark of name-to-gender inference services. PeerJ Computer Sci 4:e156
Sarrico CS (2022) The expansion of doctoral education and the changing nature and purpose of the doctorate. High Educ 84(6):1299–1315
Schilling MA, Green E (2011) Recombinant search and breakthrough idea generation: an analysis of high impact papers in the social sciences. Res Policy 40(10):1321–1331
Sebo P (2021) Performance of gender detection tools: a comparative study of name-to-gender inference services. J Med Libr Assoc JMLA 109(3):414
Sentell T, Vamos S, Okan O (2020) Interdisciplinary perspectives on health literacy research around the world: more important than ever in a time of COVID-19. Int J Environ Res Public Health 17(9):3010
Smith-Doerr L, Croissant J (2016) Gender equity and interdisciplinary collaboration. Interdisciplinarity now. The Social Science Research Council
Specht A, Crowston K (2022) Interdisciplinary collaboration from diverse science teams can produce significant outcomes. PLoS ONE 17(11):e0278043
Spelt EJ, Biemans HJ, Tobi H, Luning PA, Mulder M (2009) Teaching and learning in interdisciplinary higher education: a systematic review. Educ Psychol Rev 21:365–378
Squazzoni F, Bravo G, Farjam M, Marusic A, Mehmani B, Willis M, Grimaldo F (2021) Peer review and gender bias: a study on 145 scholarly journals. Sci Adv 7(2):eabd0299
Sugimoto CR, Ni C, Russell TG, Bychowski B (2011) Academic genealogy as an indicator of interdisciplinarity: an examination of dissertation networks in library and information science. J Am Soc Inf Sci Technol 62(9):1808–1828. https://doi.org/10.1002/asi.21568
Tannen D (1991) You just don’t understand: Women and men in conversation. Virago, London
Unger S, Erhard L, Wieczorek O, Koß C, Riebling J, Heiberger RH (2022) Benefits and detriments of interdisciplinarity on early career scientists’ performance. An author-level approach for US physicists and psychologists. PLoS ONE 17(6):e0269991
Vera-Baceta M-A, Thelwall M, Kousha K (2019) Web of Science and Scopus language coverage. Scientometrics 121(3):1803–1813
Wagner CS, Roessner JD, Bobb K, Klein JT, Boyack KW, Keyton J, Börner K (2011) Approaches to understanding and measuring interdisciplinary scientific research (IDR): a review of the literature. J Informetr 5(1):14–26
Wang J, Thijs B, Glänzel W (2015) Interdisciplinarity and impact: distinct effects of variety, balance, and disparity. PLoS ONE 10(5):e0127298
Weitzman ML (1998) Recombinant growth. Q J Econ 113(2):331–360
Whalen R, Lungeanu A, DeChurch L, Contractor N (2020) Patent similarity data and innovation metrics. J Empir Leg Stud 17(3):615–639
White K (2019) Publications output: US trends and international comparisons. Science & Engineering Indicators 2020. NSB-2020-6. National Science Foundation
Zhang L, Rousseau R, Glänzel W (2016) Diversity of references as an indicator of the interdisciplinarity of journals: taking similarity between subject fields into account. J Assoc Inf Sci Technol 67(5):1257–1265
Zhang L, Sun B, Jiang L, Huang Y (2021) On the relationship between interdisciplinarity and impact: distinct effects on academic and broader impact. Res Eval 30(3):256–268. https://doi.org/10.1093/reseval/rvab007
Zhang L, Shang Y, Huang Y, Sivertsen G (2022) Gender differences among active reviewers: an investigation based on Publons. Scientometrics 127(1):145–179
Zhang Y, Yu Q (2020) What is the best article publishing strategy for early career scientists? Scientometrics 122:397–408
Acknowledgements
This study is sponsored by the National Natural Science Foundation of China (72104054, 72104007, 72234001) and the Shanghai Pujiang Talent program (21PJC026).
Author information
Authors and Affiliations
Contributions
Conceptualization: ML and YB; methodology: ML, NZ, and SY; formal analysis: ML; writing (original draft preparation): ML; writing (review and editing): ML, YB, and NZ; visualization: ML; supervision: ML, YB, and NZ. All authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Informed consent
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Liu, M., Yang, S., Bu, Y. et al. Female early-career scientists have conducted less interdisciplinary research in the past six decades: evidence from doctoral theses. Humanit Soc Sci Commun 10, 918 (2023). https://doi.org/10.1057/s41599-023-02392-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1057/s41599-023-02392-5