<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD with MathML3 v1.1d2 20140930//EN" "JATS-journalpublishing1-mathml3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="1.1d2" xml:lang="en">
  <front>
    <journal-meta>
      <journal-id journal-id-type="nlm-ta">TETAI</journal-id>
      <journal-id journal-id-type="publisher-id">ICCK</journal-id>
      <journal-title-group>
        <journal-title>ICCK Transactions on Emerging Topics in Artificial Intelligence</journal-title>
      </journal-title-group>
      <issn pub-type="ppub" publication-format="print"/>
      <issn pub-type="epub" publication-format="electronic">3068-6652</issn>
      <publisher>
        <publisher-name>Institute of Central Computation and Knowledge Inc</publisher-name>
        <publisher-loc>522 W RIVERSIDE AVE STE N, SPOKANE, WA, 99201, UNITED STATES</publisher-loc>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.62762/TETAI.2025.416797</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Review Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Advances in Artificial Intelligence-Based Depression Diagnosis: A Systematic Review</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">https://orcid.org/0000-0001-5364-3550</contrib-id>
          <name>
            <surname>Wang</surname>
            <given-names>Liangguo</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Wu</surname>
            <given-names>Jiaqian</given-names>
          </name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff1"><label>1</label>Xinjiang Hetian College, Xinjiang 848000, China</aff>
        <aff id="aff2"><label>2</label>Beijing Anding Hospital, Capital Medical University, Beijing 100088, China</aff>
      </contrib-group>
      <author-notes>
        <corresp id="cor2">Corresponding Author: Jiaqian Wu. Email: <email>1310908555@qq.com</email></corresp>
      </author-notes>
      <pub-date date-type="pub" pub-type="epub" publication-format="online">
        <day>27</day>
        <month>8</month>
        <year>2025</year>
      </pub-date>
      <volume>2</volume>
      <issue>3</issue>
      <fpage>148</fpage>
      <lpage>156</lpage>
      <history>
        <date date-type="received">
          <day>22</day>
          <month>5</month>
          <year>2025</year>
        </date>
        <date date-type="accepted">
          <day>18</day>
          <month>7</month>
          <year>2025</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>© 2025 by the Authors. Published by Institute of Central Computation and Knowledge. This is an open access article under the CC BY license (https://creativecommons.org/licenses/by/4.0/).</copyright-statement>
        <copyright-year>2025</copyright-year>
        <copyright-holder>The Authors</copyright-holder>
        <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
        <license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
        </license>
      </permissions>
      <self-uri xlink:href="https://www.icck.org/article/abs/tetai.2025.416797">This article is available from https://www.icck.org/article/abs/tetai.2025.416797</self-uri>
      <abstract>
        <p>This study systematically reviews the current status and recent advances in intelligent depression detection, aiming to provide insights for applying artificial intelligence in mental health. Using a systematic review approach, we analyze detection methods based on multiple data types including voice, facial expressions, body signals, and social media texts, while examining how algorithms have evolved from traditional machine learning to deep learning. Results show that AI technology has clear benefits in improving detection accuracy, reducing costs, and enabling early warning systems. Current research still faces important challenges in data collection, technical reliability, clinical use, and privacy concerns. Future work should focus on combining knowledge from different fields, implementing systems in clinical practice, and developing standards for wider adoption.</p>
      </abstract>
      <kwd-group kwd-group-type="author" xml:lang="en">
        <kwd>depression</kwd>
        <kwd>artificial intelligence</kwd>
        <kwd>multimodal features</kwd>
        <kwd>deep learning</kwd>
        <kwd>intelligent detection</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="S1">
      <label>1.</label>
      <title>Introduction</title>
      <p id="S1.p1">Depression represents one of the most prevalent mental health disorders worldwide, with far-reaching implications for global public health. In recent years, artificial intelligence (AI) has emerged as a promising approach for enhancing depression identification and diagnosis. Modern AI-based detection systems analyze multiple types of data simultaneously, examining vocal characteristics, facial expressions, and physiological measurements to create more objective diagnostic tools [<xref rid="ref001" ref-type="bibr">1</xref>, <xref rid="ref002" ref-type="bibr">2</xref>].</p>
      <p id="S1.p2">The global impact of depression is substantial and growing. According to the World Health Organization (WHO), approximately 350 million people suffer from depression globally, with incidence rates showing a consistent increase [<xref rid="ref003" ref-type="bibr">3</xref>]. This situation worsened during the COVID-19 pandemic, which triggered a 25% rise in depression cases worldwide during 2020 [<xref rid="ref004" ref-type="bibr">4</xref>]. In China specifically, studies report a lifetime prevalence rate of 6.8%, leading to significant social and economic burdens [<xref rid="ref005" ref-type="bibr">5</xref>]. Current global health assessments consistently rank depression among the leading contributors to worldwide disease burden [<xref rid="ref006" ref-type="bibr">6</xref>].</p>
      <p id="S1.p3">Traditional depression diagnosis relies heavily on clinical judgment and standardized questionnaires. While these approaches remain essential, they face important limitations: (1) inconsistent diagnoses between different clinicians, (2) lengthy assessment procedures, and (3) difficulties implementing these methods at scale, especially in settings with limited resources [<xref rid="ref007" ref-type="bibr">7</xref>]. AI technologies help address these challenges by reducing subjective elements in diagnosis and improving overall efficiency [<xref rid="ref008" ref-type="bibr">8</xref>].</p>
      <p id="S1.p4">The application of AI in depression diagnosis has led to notable improvements in both early detection capabilities and diagnostic accuracy [<xref rid="ref009" ref-type="bibr">9</xref>]. Recent technological progress has enabled significant advances in depression detection through the analysis of multiple data types [<xref rid="ref010" ref-type="bibr">10</xref>, <xref rid="ref011" ref-type="bibr">11</xref>], including:</p>
      <p>
        <list list-type="bullet" id="S1.I1">
          <list-item id="S1.I1.i1">
            <p id="S1.I1.i1.p1">Voice analysis technologies that identify depression markers in speech</p>
          </list-item>
          <list-item id="S1.I1.i2">
            <p id="S1.I1.i2.p1">Systems that automatically detect depression indicators in facial expressions</p>
          </list-item>
          <list-item id="S1.I1.i3">
            <p id="S1.I1.i3.p1">Methods for processing biological and physiological signals</p>
          </list-item>
          <list-item id="S1.I1.i4">
            <p id="S1.I1.i4.p1">Techniques that analyze social media content for signs of depression</p>
          </list-item>
        </list>
      </p>
      <p>This systematic review synthesizes current research findings and develops a structured framework for effectively implementing these intelligent detection technologies in real-world clinical settings.</p>
    </sec>
    <sec id="S2">
      <label>2.</label>
      <title>Multimodal Feature Analysis of Intelligent Depression Detection</title>
      <p id="S2.p1">The clinical manifestations of depression are complex and diverse, including changes in multiple dimensions such as emotions, cognition, behavior, and physiology. Based on artificial intelligence, depression detection technology can capture the characteristic manifestations of depression from different perspectives by analyzing multimodal data such as patients' voice, facial expressions, physiological signals, and social media behaviors. Studies have shown that the comprehensive analysis of multimodal features can provide more comprehensive and reliable detection results [<xref rid="ref016" ref-type="bibr">16</xref>].</p>
      <sec id="S2.SS1">
        <label>2.1</label>
        <title>Voice Feature Analysis</title>
        <p id="S2.SS1.p1">Voice feature analysis is one of the earliest and most mature research directions in intelligent depression detection. Cummins et al. [<xref rid="ref019" ref-type="bibr">19</xref>] systematically summarized the application of voice analysis in depression assessment and found that the voice of depressed patients has obvious changes in prosodic features, including a decrease in fundamental frequency (F0), a weakening of voice energy, and a slowdown in speech rate. These features can be objectively quantified through digital signal processing technology. Low et al. [<xref rid="ref010" ref-type="bibr">10</xref>] demonstrated how to use natural language processing technology to analyze voice features and verified that these features have a significant correlation with the severity of depressive symptoms. In addition to basic acoustic features, more complex features are also included.</p>
      </sec>
      <sec id="S2.SS2">
        <label>2.2</label>
        <title>Facial Expression Analysis</title>
        <p id="S2.SS2.p1">Facial expressions are important external manifestations of emotional states. Depressed patients often exhibit reduced facial expressions and dull emotional expressions. The facial expression analysis system developed by Ringeval et al. [<xref rid="ref012" ref-type="bibr">12</xref>] can track the movement of key facial feature points in real-time. By analyzing the spatio-temporal change patterns of these feature points, the facial expression features of depressed patients can be effectively identified. The system proposed by Ringeval et al. [<xref rid="ref012" ref-type="bibr">12</xref>] can not only track the movement of facial key points but also analyze micro-expression changes and emotional transition processes. This study found that depressed patients have specific features in the dynamic change patterns of facial expressions, such as delayed expression transitions and the duration of emotional expressions. Facial expression analysis can also be combined with temporal information for dynamic modeling.</p>
      </sec>
      <sec id="S2.SS3">
        <label>2.3</label>
        <title>Physiological Signal Analysis</title>
        <p id="S2.SS3.p1">Physiological signal analysis provides an important basis for the objective assessment of depression. Cai et al. [<xref rid="ref014" ref-type="bibr">14</xref>] demonstrated a depression detection method based on electroencephalogram (EEG) signals and verified the value of physiological signal analysis in depression diagnosis. Jacobson et al. [<xref rid="ref015" ref-type="bibr">15</xref>] explored the application of digital biomarkers in emotional disorder assessment and demonstrated the advantages of wearable devices in continuously monitoring physiological indicators. These studies provide new technical means for the dynamic assessment of depressive states.</p>
      </sec>
      <sec id="S2.SS4">
        <label>2.4</label>
        <title>Social Media Behavior Analysis</title>
        <p id="S2.SS4.p1">Social media behavior analysis provides a new research perspective for depression detection. De Choudhury et al. [<xref rid="ref011" ref-type="bibr">11</xref>] developed a depression prediction model based on social media by analyzing users' posting content, interaction behavior, and activity patterns. Tadesse et al. [<xref rid="ref017" ref-type="bibr">17</xref>] specifically studied depression-related posts on the Reddit platform and achieved high detection accuracy through natural language processing technology, verifying the feasibility of social media data analysis in depression screening. Guntuku et al. [<xref rid="ref016" ref-type="bibr">16</xref>] systematically summarized the application of social media data in mental health assessment and demonstrated the potential of this method in early depression screening. Additionally, De Choudhury et al. [<xref rid="ref029" ref-type="bibr">29</xref>] explored the use of social media as a measurement tool for assessing depression prevalence in populations, demonstrating its potential for large-scale mental health monitoring through behavioral pattern analysis.</p>
      </sec>
      <sec id="S2.SS5">
        <label>2.5</label>
        <title>Multimodal Feature Fusion</title>
        <p id="S2.SS5.p1">Multimodal feature fusion aims to comprehensively utilize feature information from different sources to improve the comprehensiveness and accuracy of detection. From an algorithmic perspective, multimodal feature fusion mainly includes two strategies: feature-level fusion and decision-level fusion. Yang et al. [<xref rid="ref018" ref-type="bibr">18</xref>] proposed a decision-level fusion method based on decision trees. This method first processes and predicts audio, visual, and language features separately, and then integrates the PHQ - 8 score prediction results of each modality and participant features (such as PTSD diagnosis, sleep status, etc.) through a gender - specific decision tree model [<xref rid="ref018" ref-type="bibr">18</xref>].</p>
      </sec>
      <sec id="S2.SS6">
        <label>2.6</label>
        <title>Datasets for Depression Research</title>
        <p id="S2.SS6.p1">The development of AI-based depression detection systems relies heavily on high-quality, diverse datasets. The DAIC-WOZ dataset [<xref rid="ref040" ref-type="bibr">40</xref>] contains clinical interviews with 189 participants, providing audio, video, and text data along with PHQ-8 depression scores, serving as a gold standard for multimodal depression research. The AVEC challenge datasets [<xref rid="ref012" ref-type="bibr">12</xref>] offer annually updated resources with synchronized audio-visual recordings specifically designed for affective computing research.</p>
        <p>
          <table-wrap id="T1">
            <label>Table 1</label>
            <caption>
              <p>Commonly used depression prediction datasets.</p>
            </caption>
            <table>
              <thead>
                <tr>
                  <th style="border-top: 1px solid black;" align="left">
                    <bold>Dataset Name</bold>
                  </th>
                  <th style="border-top: 1px solid black;" align="left">
                    <bold>Data Type</bold>
                  </th>
                  <th style="border-top: 1px solid black;" align="justify">
                    <p>
                      <bold>Key Features</bold>
                    </p>
                  </th>
                </tr>
              </thead>
              <tbody>
                <tr>
                  <td style="border-top: 1px solid black;" align="left">DAIC-WOZ</td>
                  <td style="border-top: 1px solid black;" align="left">Audio/Video/Text</td>
                  <td style="border-top: 1px solid black;" align="justify">
                    <p>189 participants in clinical interviews with PHQ-8 depression scores</p>
                  </td>
                </tr>
                <tr>
                  <td align="left">AVEC Challenge</td>
                  <td align="left">Multimodal</td>
                  <td align="justify">
                    <p>Annual competition datasets with synchronized audio-visual recordings for affective computing research</p>
                  </td>
                </tr>
                <tr>
                  <td align="left">D-VLOG</td>
                  <td align="left">Video/Audio</td>
                  <td align="justify">
                    <p>961 vlogs (160+ hours) from 816 subjects in naturalistic settings</p>
                  </td>
                </tr>
                <tr>
                  <td align="left">Pittsburgh Depression</td>
                  <td align="left">Audio/Video</td>
                  <td align="justify">
                    <p>49 subjects with 130 samples from clinical interviews</p>
                  </td>
                </tr>
                <tr>
                  <td align="left">MODMA</td>
                  <td align="left">EEG signals</td>
                  <td align="justify">
                    <p>53-channel EEG recordings from clinically diagnosed patients and matched controls</p>
                  </td>
                </tr>
                <tr>
                  <td align="left">DEAP</td>
                  <td align="left">EEG/Physiological</td>
                  <td align="justify">
                    <p>EEG and peripheral physiological signals from 32 participants during emotion elicitation</p>
                  </td>
                </tr>
                <tr>
                  <td align="left">SEED</td>
                  <td align="left">EEG</td>
                  <td align="justify">
                    <p>EEG data collected specifically for emotion and depression analysis</p>
                  </td>
                </tr>
                <tr>
                  <td align="left">eRisk CLEF</td>
                  <td align="left">Text</td>
                  <td align="justify">
                    <p>Early risk detection data from social media platforms</p>
                  </td>
                </tr>
                <tr>
                  <td align="left">Reddit Depression Corpus</td>
                  <td align="left">Text</td>
                  <td align="justify">
                    <p>User posts from depression-related subreddits with temporal information</p>
                  </td>
                </tr>
                <tr>
                  <td align="left">Depression Twitter</td>
                  <td align="left">Social media text</td>
                  <td align="justify">
                    <p>Large-scale collection of tweets with depression indicators</p>
                  </td>
                </tr>
                <tr>
                  <td align="left">EmotionLines</td>
                  <td align="left">Text</td>
                  <td align="justify">
                    <p>29,245 labeled utterances from 2,000 dialogues with emotion annotations</p>
                  </td>
                </tr>
                <tr>
                  <td align="left">IEMOCAP</td>
                  <td align="left">Audio/Video/Motion</td>
                  <td align="justify">
                    <p>Interactive Emotional Dyadic Motion Capture Database with depression-relevant emotional states</p>
                  </td>
                </tr>
                <tr>
                  <td style="border-bottom: 1px solid black;" align="left">MELD</td>
                  <td style="border-bottom: 1px solid black;" align="left">Multimodal</td>
                  <td style="border-bottom: 1px solid black;" align="justify">
                    <p>Multimodal EmotionLines Dataset with rich contextual information</p>
                  </td>
                </tr>
              </tbody>
            </table>
          </table-wrap>
        </p>
        <p id="S2.SS6.p2">As summarized in Table <xref rid="T1" ref-type="table">1</xref>, more recently, the D-VLOG dataset [<xref rid="ref031" ref-type="bibr">31</xref>] has emerged as a valuable resource, containing 961 video blogs (approximately 160 hours) from 816 subjects in naturalistic settings, offering a more ecologically valid alternative to laboratory-collected data. The Pittsburgh Depression Dataset [<xref rid="ref032" ref-type="bibr">32</xref>] includes 49 subjects with 130 audio-visual samples recorded during clinical interviews, with expert annotations of depression severity.</p>
        <p id="S2.SS6.p3">For physiological signal analysis, several datasets are available: MODMA [<xref rid="ref041" ref-type="bibr">41</xref>] provides 53-channel EEG recordings from clinically diagnosed patients and matched controls; DEAP [<xref rid="ref033" ref-type="bibr">33</xref>] contains EEG and peripheral physiological signals from 32 participants during emotion elicitation; and SEED [<xref rid="ref034" ref-type="bibr">34</xref>] offers EEG data specifically collected for emotion and depression analysis.</p>
        <p id="S2.SS6.p4">Text-based depression research is supported by datasets such as the eRisk CLEF collection [<xref rid="ref035" ref-type="bibr">35</xref>], which contains early risk detection data from social media platforms; the Reddit Depression Corpus [<xref rid="ref042" ref-type="bibr">42</xref>], which includes user posts from depression-related subreddits; and various Twitter depression datasets [<xref rid="ref036" ref-type="bibr">36</xref>] that enable large-scale analysis of linguistic patterns.</p>
        <p id="S2.SS6.p5">Emotion-focused datasets that support depression research include EmotionLines [<xref rid="ref037" ref-type="bibr">37</xref>] (containing 29,245 labeled utterances from 2,000 dialogues), IEMOCAP [<xref rid="ref038" ref-type="bibr">38</xref>] (Interactive Emotional Dyadic Motion Capture Database), and MELD [<xref rid="ref039" ref-type="bibr">39</xref>] (Multimodal EmotionLines Dataset), which provide emotional annotations valuable for understanding affective patterns in depression.</p>
        <p id="S2.SS6.p6">These diverse datasets collectively address different aspects of depression manifestation while presenting unique challenges in data quality, annotation consistency, and ethical considerations. Researchers must carefully select appropriate datasets based on their specific research questions and methodological approaches.</p>
      </sec>
    </sec>
    <sec id="S3">
      <label>3.</label>
      <title>Algorithm Research on Intelligent Depression Detection</title>
      <p id="S3.p1">The evolution of intelligent depression detection algorithms has paralleled the broader advancements in machine learning technology, progressing from conventional machine learning approaches to sophisticated deep learning architectures. Each algorithmic paradigm offers distinct advantages in feature extraction, pattern recognition, and predictive classification, collectively providing a robust technical foundation for depression detection systems [<xref rid="ref020" ref-type="bibr">20</xref>].</p>
      <sec id="S3.SS1">
        <label>3.1</label>
        <title>Traditional Machine Learning Methods</title>
        <p id="S3.SS1.p1">Conventional machine learning algorithms established the methodological groundwork for early depression detection research. Yang et al. [<xref rid="ref001" ref-type="bibr">1</xref>] demonstrated the efficacy of support vector machines (SVM) in depression diagnosis using electroencephalography (EEG) data. Their approach employed a radial basis kernel function to effectively map high-dimensional, non-linear EEG features into a more discriminative feature space. Through systematic grid search optimization of the penalty factor C and kernel parameter <inline-formula><mml:math alttext="\gamma" display="inline"><mml:mi>γ</mml:mi></mml:math></inline-formula>, they achieved an optimal balance between model complexity and generalization capability.</p>
        <p id="S3.SS1.p2">In another significant application, Sau and Bhakta [<xref rid="ref028" ref-type="bibr">28</xref>] implemented artificial neural networks (ANN) to predict depression among elderly residents in resource-limited settings in Kolkata, India. Their innovative approach integrated standardized clinical assessment tools with multimodal feature analysis. To overcome challenges associated with feature heterogeneity, they developed a hierarchical encoding framework that normalized diverse input data types and applied one-hot encoding where appropriate. Their model architecture incorporated dropout regularization to mitigate overfitting risks, resulting in a computationally efficient prediction system particularly suited for community-based mental health screening programs. Furthermore, a meta-analysis by Lee et al. [<xref rid="ref027" ref-type="bibr">27</xref>] systematically reviewed the application of machine learning algorithms in predicting therapeutic outcomes for depression, highlighting their potential in enhancing clinical decision-making by identifying effective treatment strategies.</p>
      </sec>
      <sec id="S3.SS2">
        <label>3.2</label>
        <title>Deep Learning Methods</title>
        <p id="S3.SS2.p1">Deep learning has transformed depression detection through its powerful feature extraction and representation learning capabilities. Low et al. [<xref rid="ref010" ref-type="bibr">10</xref>] advanced this field by applying BERT (Bidirectional Encoder Representations from Transformers) to analyze linguistic patterns in social media content. Their approach fine-tuned pre-trained language models on depression-specific texts, enabling the identification of subtle linguistic markers of depression that traditional methods might miss. The contextual understanding provided by these models significantly improved detection accuracy over conventional techniques. Guntuku et al. [<xref rid="ref013" ref-type="bibr">13</xref>] further developed specialized frameworks for mental health monitoring on social platforms. Their approach combined convolutional neural networks (CNNs) with recurrent neural networks (RNNs) to capture both semantic content and temporal patterns in user-generated text. This architecture effectively detected changes in linguistic patterns that often precede clinical depression symptoms.</p>
        <p id="S3.SS2.p2">Multimodal deep learning has shown particular promise by integrating diverse data sources—audio, visual, and physiological signals—to create comprehensive representations of depressive symptoms. Yang et al. [<xref rid="ref018" ref-type="bibr">18</xref>] demonstrated that neural networks could effectively combine features from speech, facial expressions, and interview content, outperforming single-modality approaches. Attention mechanisms represent another significant advancement, enhancing both model performance and interpretability. Knyazev et al. [<xref rid="ref021" ref-type="bibr">21</xref>] provided insights into how these mechanisms improve generalization in neural networks, with important implications for depression detection systems. These approaches not only improve accuracy but also help clinicians understand which features influence the model's decisions—critical for clinical adoption. The ongoing development of explainable AI techniques continues to improve transparency, building trust and facilitating integration of these tools into clinical practice.</p>
      </sec>
      <sec id="S3.SS3">
        <label>3.3</label>
        <title>Algorithm Optimization and Improvement</title>
        <p id="S3.SS3.p1">To address clinical implementation challenges, researchers have pursued targeted algorithm optimizations that balance performance with practical constraints. Roy et al. [<xref rid="ref022" ref-type="bibr">22</xref>] conducted a comprehensive review of deep learning applications in EEG signal analysis, identifying key optimization strategies for neural network architectures processing neurophysiological data. Their analysis highlighted the importance of temporal convolutional networks and recurrent architectures in capturing the dynamic patterns characteristic of depression-related EEG signals, while also emphasizing computational efficiency considerations essential for real-time monitoring applications.</p>
        <p id="S3.SS3.p2">Kingphai and Moshfeghi [<xref rid="ref023" ref-type="bibr">23</xref>] extended these optimization approaches to mental workload assessment using EEG signals, demonstrating novel regularization techniques and parameter optimization methods that significantly improved model robustness across diverse patient populations. Their work introduced adaptive learning rate schedules and domain-specific data augmentation strategies that effectively addressed the data scarcity challenges common in clinical depression research. These optimization techniques have broader implications for depression detection systems, particularly in resource-constrained environments where computational efficiency and model reliability are paramount concerns.</p>
      </sec>
    </sec>
    <sec id="S4">
      <label>4.</label>
      <title>Existing Problems and Challenges</title>
      <p id="S4.p1">Despite significant advancements in AI-based depression detection, several critical challenges remain across multiple domains including data management, technological development, clinical integration, and ethical considerations [<xref rid="ref020" ref-type="bibr">20</xref>, <xref rid="ref024" ref-type="bibr">24</xref>, <xref rid="ref025" ref-type="bibr">25</xref>]. Addressing these challenges is essential for translating research innovations into practical clinical applications that can meaningfully impact mental healthcare delivery.</p>
      <sec id="S4.SS1">
        <label>4.1</label>
        <title>Data Quality and Standardization Issues</title>
        <p id="S4.SS1.p1">High-quality, standardized data represents the cornerstone of effective intelligent detection systems. Larson et al. [<xref rid="ref024" ref-type="bibr">24</xref>] identified several fundamental challenges in medical data acquisition and utilization that directly impact model performance and clinical validity. These include inconsistent data collection protocols, variable annotation standards, and limited representation of diverse patient populations.</p>
        <p id="S4.SS1.p2">To overcome these limitations, researchers advocate for establishing unified acquisition standards and rigorous quality control processes. This approach necessitates the formation of specialized annotation teams working with comprehensive guidelines and protocols. Implementing multi-rater annotation procedures with formal consistency assessments significantly enhances the reliability of training data, which directly translates to improved model performance in clinical applications.</p>
      </sec>
      <sec id="S4.SS2">
        <label>4.2</label>
        <title>Technical Reliability and Robustness</title>
        <p id="S4.SS2.p1">The reliability and robustness of depression detection technologies remain significant concerns for real-world deployment. In their comprehensive review, Shatte et al. [<xref rid="ref020" ref-type="bibr">20</xref>] highlighted three primary technical challenges: limited model generalization across diverse populations and clinical contexts, heightened sensitivity to data noise and environmental interference, and insufficient stability in multimodal feature fusion frameworks. These limitations can lead to inconsistent performance in naturalistic settings, potentially undermining clinical trust and adoption.</p>
      </sec>
      <sec id="S4.SS3">
        <label>4.3</label>
        <title>Clinical Practice and Application Issues</title>
        <p id="S4.SS3.p1">Bridging the gap between technological innovation and clinical implementation presents multifaceted challenges. Kellogg and Sadeh-Sharvit [<xref rid="ref030" ref-type="bibr">30</xref>] emphasized that successful clinical integration depends on addressing model interpretability, seamless workflow integration, and real-time performance capabilities. Clinicians require systems that not only provide accurate assessments but also offer transparent reasoning that aligns with established clinical frameworks. Well-designed AI-assisted diagnosis systems have demonstrated potential to enhance diagnostic efficiency and accuracy while providing reliable decision support, but their implementation requires careful attention to clinical workflows and practitioner needs.</p>
      </sec>
      <sec id="S4.SS4">
        <label>4.4</label>
        <title>Ethical Privacy and Security Issues</title>
        <p id="S4.SS4.p1">The deployment of intelligent detection systems raises significant ethical considerations regarding patient privacy and data security. Larson et al. [<xref rid="ref024" ref-type="bibr">24</xref>] proposed a comprehensive framework for protecting patient privacy in medical data utilization, emphasizing that technological innovation must proceed in parallel with robust ethical safeguards. This framework highlights the importance of informed consent, data minimization principles, and transparent algorithmic processes. Establishing sound ethical codes and privacy protection mechanisms not only safeguards patient rights but also significantly influences social acceptance and long-term technological adoption.</p>
      </sec>
      <sec id="S4.SS5">
        <label>4.5</label>
        <title>Cost-Benefit and Implementation Considerations</title>
        <p id="S4.SS5.p1">The widespread adoption of intelligent detection technologies necessitates careful evaluation of economic factors and implementation requirements. Huckvale et al. [<xref rid="ref026" ref-type="bibr">26</xref>] advocated for comprehensive assessment frameworks that consider system development and maintenance costs, infrastructure requirements, training needs, and measurable clinical outcomes. Successful implementation strategies must balance technological capabilities with resource constraints, particularly in diverse healthcare settings. This requires establishing unified data standards, developing robust and interpretable algorithms, conducting rigorous clinical validation studies, implementing appropriate ethical safeguards, and creating evidence-based evaluation systems to demonstrate clinical and economic value [<xref rid="ref020" ref-type="bibr">20</xref>, <xref rid="ref024" ref-type="bibr">24</xref>, <xref rid="ref025" ref-type="bibr">25</xref>].</p>
      </sec>
    </sec>
    <sec id="S5">
      <label>5.</label>
      <title>Future Development Trends</title>
      <p id="S5.p1">Based on the analysis of the current research status of intelligent depression detection, this section will explore the future development trends of this field from three dimensions: technological innovation, multidisciplinary integration, and clinical practice.</p>
      <sec id="S5.SS1">
        <label>5.1</label>
        <title>Technological Innovation Directions</title>
        <p id="S5.SS1.p1">The continuous innovation of artificial intelligence technology will bring new development opportunities for intelligent depression detection. In terms of data sharing and privacy protection, Graham et al. [<xref rid="ref025" ref-type="bibr">25</xref>] emphasized the need to pay special attention to privacy protection, data security, and algorithmic bias when applying AI in the mental health field. Larson et al. [<xref rid="ref024" ref-type="bibr">24</xref>] proposed an ethical framework for medical data sharing, emphasizing that data sharing should be part of clinical research rather than a commercial transaction.</p>
      </sec>
      <sec id="S5.SS2">
        <label>5.2</label>
        <title>Multidisciplinary Cross - Integration</title>
        <p id="S5.SS2.p1">The development of intelligent depression detection requires the deep integration of multidisciplinary knowledge. The guidance of psychological theories is crucial for improving the scientific nature of detection models. Shatte et al. [<xref rid="ref020" ref-type="bibr">20</xref>] pointed out in a review study that combining psychological theories with machine learning methods can significantly improve the clinical practicality and interpretability of models. Cummins et al. [<xref rid="ref019" ref-type="bibr">19</xref>] presented a depression detection method based on voice analysis in a systematic review and proposed a complete technical roadmap from voice features to clinical application. Combining this method with traditional clinical assessments can provide more objective diagnostic evidence.</p>
      </sec>
      <sec id="S5.SS3">
        <label>5.3</label>
        <title>Clinical Practice Promotion</title>
        <p id="S5.SS3.p1">Promoting the clinical application of intelligent detection technology is a key direction for future development. Graham et al. [<xref rid="ref025" ref-type="bibr">25</xref>] showed that the application of artificial intelligence in the mental health field requires a multi - dimensional evaluation system. Technical verification should be carried out in a single - center to evaluate the basic performance and usability of the system. Through multi - center collaborative research, the applicability and consistency of the technology in different scenarios should be verified, and a standardized clinical application process and quality control system should be established.</p>
      </sec>
    </sec>
    <sec id="S6">
      <label>6.</label>
      <title>Conclusion and Outlook</title>
      <p id="S6.p1">This paper systematically reviewed the research status, key technologies, and future trends in the field of intelligent depression detection. Current research still faces the following main challenges: difficulty in obtaining high - quality data, the need to improve the stability and generalization ability of algorithm models, and practical problems such as system integration and workflow adaptation in clinical practice. The development of intelligent depression detection will pay more attention to technological innovation, clinical practice, and ethical norms.</p>
      <p id="S6.p2">Achieving these goals requires the continuous efforts and in - depth collaboration of all parties in industry, academia, research, and medicine. Through the organic combination of technological innovation and clinical practice, intelligent detection technology will surely play a greater role in the early screening, diagnosis, assessment, and prognosis monitoring of depression, and make positive contributions to improving the diagnosis and treatment level of depression and improving patient prognosis.</p>
    </sec>
  </body>
  <back>
    <ack>
      <title>Acknowledgments</title>
      <p id="ack.p1">This work was supported by the Key Project of Natural Science Foundation of Xinjiang Hetian College under Grant 2025ZR002.</p>
    </ack>
    <sec id="sec0100" sec-type="COI-statement">
      <title>Conflict of interest</title>
      <p>The authors declare no conflicts of interest.</p>
    </sec>
    <ref-list>
      <title>References</title>
      <ref id="ref001">
        <label>[1]</label>
        <mixed-citation> Yang, C. Y., &amp; Chen, Y. Z. (2024). Support vector machine classification of patients with depression based on resting-state electroencephalography. <italic>Asian Biomedicine: Research, Reviews and News, 18</italic>(5), 212-223. [<uri>https://doi.org/10.2478/abm-2024-0029</uri>] </mixed-citation>
      </ref>
      <ref id="ref002">
        <label>[2]</label>
        <mixed-citation> Su, C., Xu, Z., Pathak, J., &amp; Wang, F. (2020). Deep learning in mental health outcome research: a scoping review. <italic>Translational psychiatry, 10</italic>(1), 116. [<uri>https://doi.org/10.1038/s41398-020-0780-3</uri>] </mixed-citation>
      </ref>
      <ref id="ref003">
        <label>[3]</label>
        <mixed-citation> World Health Organization. (2022). <italic>World mental health report: Transforming mental health for all</italic>. WHO Press. </mixed-citation>
      </ref>
      <ref id="ref004">
        <label>[4]</label>
        <mixed-citation> Santomauro, D. F., Herrera, A. M. M., Shadid, J., Zheng, P., Ashbaugh, C., Pigott, D. M., … &amp; Ferrari, A. J. (2021). Global prevalence and burden of depressive and anxiety disorders in 204 countries and territories in 2020 due to the COVID-19 pandemic. <italic>The Lancet, 398</italic>(10312), 1700-1712. [<uri>https://doi.org/10.1016/S0140-6736(21)02143-7</uri>] </mixed-citation>
      </ref>
      <ref id="ref005">
        <label>[5]</label>
        <mixed-citation> Wang, J., Zhao, Z., Yang, J., Wang, L., Zhang, M., &amp; Zhou, M. (2024). The Association Between Depression and All-Cause, Cause-Specific Mortality in the Chinese Population—China, 2010–2022. <italic>China CDC Weekly, 6</italic>(40), 1022. [<uri>https://doi.org/10.46234/ccdcw2024.212</uri>] </mixed-citation>
      </ref>
      <ref id="ref006">
        <label>[6]</label>
        <mixed-citation> The Lancet Psychiatry. (2024). Global burden of disease 2021: Mental health messages. <italic>The Lancet Psychiatry, 11</italic>(8), 573. [<uri>https://doi.org/10.1016/S2215-0366(24)00222-0</uri>] </mixed-citation>
      </ref>
      <ref id="ref007">
        <label>[7]</label>
        <mixed-citation> Chinese Medical Association, Chinese Medical Association Publishing House, &amp; Chinese Medical Association General Practice Branch. (2021). Guidelines for the diagnosis and treatment of depression in primary care (2021). <italic>Chinese Journal of General Practitioners, 20</italic>(12), 1101–1116. [<uri>https://doi.org/10.3760/cma.j.cn114798-20211020-00778</uri>] </mixed-citation>
      </ref>
      <ref id="ref008">
        <label>[8]</label>
        <mixed-citation> Torous, J., Myrick, K. J., Rauseo-Ricupero, N., &amp; Firth, J. (2020). Digital mental health and COVID-19: using technology today to accelerate the curve on access and quality tomorrow. <italic>JMIR mental health, 7</italic>(3), e18848. [<uri>https://doi.org/10.2196/18848</uri>] </mixed-citation>
      </ref>
      <ref id="ref009">
        <label>[9]</label>
        <mixed-citation> Sadeghi, M., Richer, R., Egger, B., Schindler-Gmelch, L., Rupp, L. H., Rahimi, F., … &amp; Eskofier, B. M. (2024). Harnessing multimodal approaches for depression detection using large language models and facial expressions. <italic>npj Mental Health Research, 3</italic>(1), 66. [<uri>https://doi.org/10.1038/s44184-024-00112-8</uri>] </mixed-citation>
      </ref>
      <ref id="ref010">
        <label>[10]</label>
        <mixed-citation> Low, D. M., Rumker, L., Talkar, T., Torous, J., Cecchi, G., &amp; Ghosh, S. S. (2020). Natural language processing reveals vulnerable mental health support groups and heightened health anxiety on reddit during covid-19: Observational study. <italic>Journal of medical Internet research, 22</italic>(10), e22635. [<uri>https://doi.org/10.2196/22635</uri>] </mixed-citation>
      </ref>
      <ref id="ref011">
        <label>[11]</label>
        <mixed-citation> De Choudhury, M., Gamon, M., Counts, S., &amp; Horvitz, E. (2013). Predicting depression via social media. In <italic>Proceedings of the international AAAI conference on web and social media</italic> (Vol. 7, No. 1, pp. 128-137). [<uri>https://doi.org/10.1609/icwsm.v7i1.14432</uri>] </mixed-citation>
      </ref>
      <ref id="ref012">
        <label>[12]</label>
        <mixed-citation> Ringeval, F., Schuller, B., Valstar, M., Cummins, N., Cowie, R., Tavabi, L., … &amp; Pantic, M. (2019, October). AVEC 2019 workshop and challenge: state-of-mind, detecting depression with AI, and cross-cultural affect recognition. In <italic>Proceedings of the 9th International on Audio/visual Emotion Challenge and Workshop</italic> (pp. 3-12). [<uri>https://doi.org/10.1145/3347320.3357688</uri>] </mixed-citation>
      </ref>
      <ref id="ref013">
        <label>[13]</label>
        <mixed-citation> Guntuku, S. C., Sherman, G., Stokes, D. C., Agarwal, A. K., Seltzer, E., Merchant, R. M., &amp; Ungar, L. H. (2020). Tracking mental health and symptom mentions on Twitter during COVID-19. <italic>Journal of general internal medicine, 35</italic>(9), 2798-2800. [<uri>https://doi.org/10.1007/s11606-020-05988-8</uri>] </mixed-citation>
      </ref>
      <ref id="ref014">
        <label>[14]</label>
        <mixed-citation> Cai, H., Han, J., Chen, Y., Sha, X., Wang, Z., Hu, B., … &amp; Gutknecht, J. (2018). A pervasive approach to EEG-based depression detection. <italic>complexity, 2018</italic>(1), 5238028. [<uri>https://doi.org/10.1155/2018/5238028</uri>] </mixed-citation>
      </ref>
      <ref id="ref015">
        <label>[15]</label>
        <mixed-citation> Jacobson, N. C., Weingarden, H., &amp; Wilhelm, S. (2019). Digital biomarkers of mood disorders and symptom change. <italic>NPJ digital medicine, 2</italic>(1), 3. [<uri>https://doi.org/10.1038/s41746-019-0078-0</uri>] </mixed-citation>
      </ref>
      <ref id="ref016">
        <label>[16]</label>
        <mixed-citation> Guntuku, S. C., Yaden, D. B., Kern, M. L., Ungar, L. H., &amp; Eichstaedt, J. C. (2017). Detecting depression and mental illness on social media: an integrative review. <italic>Current Opinion in Behavioral Sciences, 18</italic>, 43-49. [<uri>https://doi.org/10.1016/j.cobeha.2017.07.005</uri>] </mixed-citation>
      </ref>
      <ref id="ref017">
        <label>[17]</label>
        <mixed-citation> Tadesse, M. M., Lin, H., Xu, B., &amp; Yang, L. (2019). Detection of depression-related posts in reddit social media forum. <italic>Ieee Access, 7</italic>, 44883-44893. [<uri>https://doi.org/10.1109/ACCESS.2019.2909180</uri>] </mixed-citation>
      </ref>
      <ref id="ref018">
        <label>[18]</label>
        <mixed-citation> Yang, L., Jiang, D., He, L., Pei, E., Oveneke, M. C., &amp; Sahli, H. (2016, October). Decision tree based depression classification from audio video and language information. In <italic>Proceedings of the 6th international workshop on audio/visual emotion challenge</italic> (pp. 89-96). [<uri>https://doi.org/10.1145/2988257.2988269</uri>] </mixed-citation>
      </ref>
      <ref id="ref019">
        <label>[19]</label>
        <mixed-citation> Cummins, N., Scherer, S., Krajewski, J., Schnieder, S., Epps, J., &amp; Quatieri, T. F. (2015). A review of depression and suicide risk assessment using speech analysis. <italic>Speech communication, 71</italic>, 10-49. [<uri>https://doi.org/10.1016/j.specom.2015.03.004</uri>] </mixed-citation>
      </ref>
      <ref id="ref020">
        <label>[20]</label>
        <mixed-citation> Shatte, A. B., Hutchinson, D. M., &amp; Teague, S. J. (2019). Machine learning in mental health: a scoping review of methods and applications. <italic>Psychological medicine, 49</italic>(9), 1426-1448. [<uri>https://doi.org/10.1017/S0033291719000151</uri>] </mixed-citation>
      </ref>
      <ref id="ref021">
        <label>[21]</label>
        <mixed-citation> Knyazev, B., Taylor, G. W., &amp; Amer, M. (2019). Understanding attention and generalization in graph neural networks. <italic>Advances in neural information processing systems, 32</italic>. </mixed-citation>
      </ref>
      <ref id="ref022">
        <label>[22]</label>
        <mixed-citation> Roy, Y., Banville, H., Albuquerque, I., Gramfort, A., Falk, T. H., &amp; Faubert, J. (2019). Deep learning-based electroencephalography analysis: a systematic review. <italic>Journal of neural engineering, 16</italic>(5), 051001. [<uri>https://doi.org/10.1088/1741-2552/ab260c</uri>] </mixed-citation>
      </ref>
      <ref id="ref023">
        <label>[23]</label>
        <mixed-citation> Kingphai, K., &amp; Moshfeghi, Y. (2024). Mental workload assessment using deep learning models from eeg signals: A systematic review. <italic>IEEE Transactions on Cognitive and Developmental Systems</italic>. [<uri>https://doi.org/10.1109/TCDS.2024.3460750</uri>] </mixed-citation>
      </ref>
      <ref id="ref024">
        <label>[24]</label>
        <mixed-citation> Larson, D. B., Magnus, D. C., Lungren, M. P., Shah, N. H., &amp; Langlotz, C. P. (2020). Ethics of using and sharing clinical imaging data for artificial intelligence: a proposed framework. <italic>Radiology, 295</italic>(3), 675-682. [<uri>https://doi.org/10.1148/radiol.2020192536</uri>] </mixed-citation>
      </ref>
      <ref id="ref025">
        <label>[25]</label>
        <mixed-citation> Graham, S., Depp, C., Lee, E. E., Nebeker, C., Tu, X., Kim, H. C., &amp; Jeste, D. V. (2019). Artificial intelligence for mental health and mental illnesses: an overview. <italic>Current psychiatry reports, 21</italic>(11), 116. [<uri>https://doi.org/10.1007/s11920-019-1094-0</uri>] </mixed-citation>
      </ref>
      <ref id="ref026">
        <label>[26]</label>
        <mixed-citation> Huckvale, K., Venkatesh, S., &amp; Christensen, H. (2019). Toward clinical digital phenotyping: a timely opportunity to consider purpose, quality, and safety. <italic>NPJ digital medicine, 2</italic>(1), 88. [<uri>https://doi.org/10.1038/s41746-019-0166-1</uri>] </mixed-citation>
      </ref>
      <ref id="ref027">
        <label>[27]</label>
        <mixed-citation> Lee, Y., Ragguett, R. M., Mansur, R. B., Boutilier, J. J., Rosenblat, J. D., Trevizol, A., … &amp; McIntyre, R. S. (2018). Applications of machine learning algorithms to predict therapeutic outcomes in depression: a meta-analysis and systematic review. <italic>Journal of affective disorders, 241</italic>, 519-532. [<uri>https://doi.org/10.1016/j.jad.2018.08.073</uri>] </mixed-citation>
      </ref>
      <ref id="ref028">
        <label>[28]</label>
        <mixed-citation> Sau, A., &amp; Bhakta, I. (2017). Artificial neural network (ANN) model to predict depression among geriatric population at a slum in Kolkata, India. <italic>Journal of clinical and diagnostic research: JCDR, 11</italic>(5), VC01. [<uri>https://doi.org/10.7860/JCDR/2017/23656.9762</uri>] </mixed-citation>
      </ref>
      <ref id="ref029">
        <label>[29]</label>
        <mixed-citation> De Choudhury, M., Counts, S., &amp; Horvitz, E. (2013, May). Social media as a measurement tool of depression in populations. In <italic>Proceedings of the 5th annual ACM web science conference</italic> (pp. 47-56). [<uri>https://doi.org/10.1145/2464464.2464480</uri>] </mixed-citation>
      </ref>
      <ref id="ref030">
        <label>[30]</label>
        <mixed-citation> Kellogg, K. C., &amp; Sadeh-Sharvit, S. (2022). Pragmatic AI-augmentation in mental healthcare: key technologies, potential benefits, and real-world challenges and solutions for frontline clinicians. <italic>Frontiers in psychiatry, 13</italic>, 990370. [<uri>https://doi.org/10.3389/fpsyt.2022.990370</uri>] </mixed-citation>
      </ref>
      <ref id="ref031">
        <label>[31]</label>
        <mixed-citation> Yoon, J., Kang, C., Kim, S., &amp; Han, J. (2022, June). D-vlog: Multimodal vlog dataset for depression detection. In <italic>Proceedings of the AAAI Conference on Artificial Intelligence</italic> (Vol. 36, No. 11, pp. 12226-12234). [<uri>https://doi.org/10.1609/aaai.v36i11.21483</uri>] </mixed-citation>
      </ref>
      <ref id="ref032">
        <label>[32]</label>
        <mixed-citation> Girard, J. M., Cohn, J. F., Mahoor, M. H., Mavadati, S. M., Hammal, Z., &amp; Rosenwald, D. P. (2014). Nonverbal social withdrawal in depression: Evidence from manual and automatic analyses. <italic>Image and vision computing, 32</italic>(10), 641-647. [<uri>https://doi.org/10.1016/j.imavis.2013.12.007</uri>] </mixed-citation>
      </ref>
      <ref id="ref033">
        <label>[33]</label>
        <mixed-citation> Koelstra, S., Muhl, C., Soleymani, M., Lee, J. S., Yazdani, A., Ebrahimi, T., … &amp; Patras, I. (2011). Deap: A database for emotion analysis; using physiological signals. <italic>IEEE transactions on affective computing, 3</italic>(1), 18-31. [<uri>https://doi.org/10.1109/T-AFFC.2011.15</uri>] </mixed-citation>
      </ref>
      <ref id="ref034">
        <label>[34]</label>
        <mixed-citation> Zheng, W. L., &amp; Lu, B. L. (2015). Investigating critical frequency bands and channels for EEG-based emotion recognition with deep neural networks. <italic>IEEE Transactions on autonomous mental development, 7</italic>(3), 162-175. [<uri>https://doi.org/10.1109/TAMD.2015.2431497</uri>] </mixed-citation>
      </ref>
      <ref id="ref035">
        <label>[35]</label>
        <mixed-citation> Losada, D. E., &amp; Crestani, F. (2016, August). A test collection for research on depression and language use. In <italic>International conference of the cross-language evaluation forum for European languages</italic> (pp. 28-39). Cham: Springer International Publishing. [<uri>https://doi.org/10.1007/978-3-319-44564-9_3</uri>] </mixed-citation>
      </ref>
      <ref id="ref036">
        <label>[36]</label>
        <mixed-citation> Coppersmith, G., Dredze, M., Harman, C., &amp; Hollingshead, K. (2015). From ADHD to SAD: Analyzing the language of mental health on Twitter through self-reported diagnoses. In <italic>Proceedings of the 2nd workshop on computational linguistics and clinical psychology: from linguistic signal to clinical reality</italic> (pp. 1-10). </mixed-citation>
      </ref>
      <ref id="ref037">
        <label>[37]</label>
        <mixed-citation> Chen, S. Y., Hsu, C. C., Kuo, C. C., &amp; Ku, L. W. (2018). Emotionlines: An emotion corpus of multi-party conversations. <italic>arXiv preprint arXiv:1802.08379</italic>. </mixed-citation>
      </ref>
      <ref id="ref038">
        <label>[38]</label>
        <mixed-citation> Busso, C., Bulut, M., Lee, C. C., Kazemzadeh, A., Mower, E., Kim, S., … &amp; Narayanan, S. S. (2008). IEMOCAP: Interactive emotional dyadic motion capture database. <italic>Language resources and evaluation, 42</italic>(4), 335-359. [<uri>https://doi.org/10.1007/s10579-008-9076-6</uri>] </mixed-citation>
      </ref>
      <ref id="ref039">
        <label>[39]</label>
        <mixed-citation> Poria, S., Hazarika, D., Majumder, N., Naik, G., Cambria, E., &amp; Mihalcea, R. (2018). Meld: A multimodal multi-party dataset for emotion recognition in conversations. <italic>arXiv preprint arXiv:1810.02508</italic>. </mixed-citation>
      </ref>
      <ref id="ref040">
        <label>[40]</label>
        <mixed-citation> Gratch, J., Artstein, R., Lucas, G. M., Stratou, G., Scherer, S., Nazarian, A., … &amp; Morency, L. P. (2014, May). The distress analysis interview corpus of human and computer interviews. <italic>In LREC</italic> (Vol. 14, pp. 3123-3128). </mixed-citation>
      </ref>
      <ref id="ref041">
        <label>[41]</label>
        <mixed-citation> Cai, H., Yuan, Z., Gao, Y., Sun, S., Li, N., Tian, F., … &amp; Hu, B. (2022). A multi-modal open dataset for mental-disorder analysis. <italic>Scientific Data, 9</italic>(1), 178. [<uri>https://doi.org/10.1038/s41597-022-01211-x</uri>] </mixed-citation>
      </ref>
      <ref id="ref042">
        <label>[42]</label>
        <mixed-citation> Pirina, I., &amp; Çöltekin, Ç. (2018, October). Identifying depression on reddit: The effect of training data. In <italic>Proceedings of the 2018 EMNLP workshop SMM4H: the 3rd social media mining for health applications workshop &amp; shared task</italic> (pp. 9-12). [<uri>https://doi.org/10.18653/v1/W18-5903</uri>] </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>
