<doi_batch xmlns="http://www.crossref.org/schema/4.4.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" version="4.4.0"><head><doi_batch_id>15152e1b-6bf2-4413-b96a-ca1721f333a0</doi_batch_id><timestamp>20251121131230975</timestamp><depositor><depositor_name>wseas:wseas</depositor_name><email_address>mdt@crossref.org</email_address></depositor><registrant>MDT Deposit</registrant></head><body><journal><journal_metadata language="en"><full_title>MOLECULAR SCIENCES AND APPLICATIONS</full_title><issn media_type="electronic">2732-9992</issn><issn media_type="print">2944-9138</issn><archive_locations><archive name="Portico" /></archive_locations><doi_data><doi>10.37394/232023</doi><resource>https://wseas.com/journals/msa/index.php</resource></doi_data></journal_metadata><journal_issue><publication_date media_type="online"><month>4</month><day>23</day><year>2025</year></publication_date><publication_date media_type="print"><month>4</month><day>23</day><year>2025</year></publication_date><journal_volume><volume>5</volume><doi_data><doi>10.37394/232023.2025.5</doi><resource>https://wseas.com/journals/msa/2025.php</resource></doi_data></journal_volume></journal_issue><journal_article language="en"><titles><title>A Mixture Model Based Read Simulating Method for Single Cell RNA Sequencing</title></titles><contributors><person_name sequence="first" contributor_role="author"><given_name>Wenshan</given_name><surname>Li</surname><affiliation>School of Cyber Science and Engineering Sichuan University No. 24 South Section 1, Yihuan Road, Chengdu CHINA</affiliation></person_name><person_name sequence="additional" contributor_role="author"><given_name>Wenbo</given_name><surname>Fang</surname><affiliation>School of Cyber Science and Engineering Sichuan University No. 24 South Section 1, Yihuan Road, Chengdu CHINA</affiliation></person_name><person_name sequence="additional" contributor_role="author"><given_name>Ao</given_name><surname>Liu</surname><affiliation>School of Cyber Science and Engineering Sichuan University No. 24 South Section 1, Yihuan Road, Chengdu CHINA</affiliation></person_name><person_name sequence="additional" contributor_role="author"><given_name>Beibei</given_name><surname>Li</surname><affiliation>School of Cyber Science and Engineering Sichuan University No. 24 South Section 1, Yihuan Road, Chengdu CHINA</affiliation></person_name><person_name sequence="additional" contributor_role="author"><given_name>Junjiang</given_name><surname>He</surname><affiliation>School of Cyber Science and Engineering Sichuan University No. 24 South Section 1, Yihuan Road, Chengdu CHINA</affiliation></person_name><person_name sequence="additional" contributor_role="author"><given_name>Hongxia</given_name><surname>Wang</surname><affiliation>School of Cyber Science and Engineering Sichuan University No. 24 South Section 1, Yihuan Road, Chengdu CHINA</affiliation></person_name></contributors><jats:abstract xmlns:jats="http://www.ncbi.nlm.nih.gov/JATS1"><jats:p>Techniques for single-cell RNA sequencing (scRNA-seq) has enabled unprecedented insights into gene expressions in cell level. Drop-seq is one of the prominent scRNA-seq protocols, and there has been a rapid growth in related analysis tools for Drop-seq data. These methods are tested either using spike-in experiments or on simulation datasets as the real word gene differential expressions are usually unknown. Since spike-in experiments are expensive and time consuming, simulated datasets have become a reasonable alternative method. However, current RNA-seq simulators mostly target at bulk RNA sequencing, which provokes the need of a scRNA-seq simulator for the Drop-seq technology. In this paper, we present a mixture model based read simulating method to simulate the sequencing reads of a Drop-seq experiment. The proposed method is able to simulate large amounts of Drop-seq reads according to the user's experimental setting. Data generated by the proposed model is a reasonable approximation to real Drop-seq data.</jats:p></jats:abstract><publication_date media_type="online"><month>11</month><day>21</day><year>2025</year></publication_date><publication_date media_type="print"><month>11</month><day>21</day><year>2025</year></publication_date><pages><first_page>179</first_page><last_page>189</last_page></pages><publisher_item><item_number item_number_type="article_number">13</item_number></publisher_item><ai:program xmlns:ai="http://www.crossref.org/AccessIndicators.xsd" name="AccessIndicators"><ai:free_to_read start_date="2025-11-21" /><ai:license_ref applies_to="am" start_date="2025-11-21">https://wseas.com/journals/msa/2025/a26msa-012(2025).pdf</ai:license_ref></ai:program><archive_locations><archive name="Portico" /></archive_locations><doi_data><doi>10.37394/232023.2025.5.13</doi><resource>https://wseas.com/journals/msa/2025/a26msa-012(2025).pdf</resource></doi_data><citation_list><citation key="ref0"><doi>10.1038/nmeth.2694</doi><unstructured_citation>Wu, A. R. , Neff, N. F. , Kalisky, T. , Dalerba, P. , Treutlein, B. , &amp; Rothenberg, M. E. , et al. Quantitative assessment of single-cell rnasequencing methods. Nature Methods. </unstructured_citation></citation><citation key="ref1"><doi>10.1016/j.molcel.2015.04.005</doi><unstructured_citation>Kolodziejczyk, A. , Kim, J. K. , Svensson, V. , Marioni, J. , &amp; Teichmann, S. . (2015). The technology and biology of single-cell rna sequencing. Molecular Cell, 58(4), 610-620. </unstructured_citation></citation><citation key="ref2"><doi>10.1038/s12276-018-0071-8</doi><unstructured_citation>Byungjin, H. , Hyun, L. J. , &amp; Duhee, B. . (2018). Single-cell rna sequencing technologies and bioinformatics pipelines. Experimental &amp; Molecular Medicine, 50(8), 96-. </unstructured_citation></citation><citation key="ref3"><doi>10.1038/nbt.2282</doi><unstructured_citation>Ramsköld, D. , Luo, S. , Wang, Y. C. , Li, R. , Deng, Q. , &amp; Faridani, O. R. , et al. (2012). Fulllength mrna-seq from single-cell levels of rna and individual circulating tumor cells. Nature Biotechnology, 30(8), 777-782. </unstructured_citation></citation><citation key="ref4"><doi>10.1101/gr.110882.110</doi><unstructured_citation>Islam, S. , Kjallquist, U. , Moliner, A. , Zajac, P. , Fan, J. B. , &amp; Lonnerberg, P. , et al. (2011). Characterization of the single-cell transcriptional landscape by highly multiplex rna-seq. Genome Research, 21(7), 1160. </unstructured_citation></citation><citation key="ref5"><doi>10.1016/j.celrep.2012.08.003</doi><unstructured_citation>Hashimshony, T. , Wagner, F. , Sher, N. , &amp; Yanai, I. . (2012). Cel-seq: single-cell rna-seq by multiplexed linear amplification. Cell Reports, 2(3), 666-673. </unstructured_citation></citation><citation key="ref6"><doi>10.1016/j.cell.2015.04.044</doi><unstructured_citation>Klein, A. , Mazutis, L. , Akartuna, I. , Tallapragada, N. , Veres, A. , &amp; Li, V. , et al. (2015). Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells. Cell, 161(5), 1187-1201. </unstructured_citation></citation><citation key="ref7"><doi>10.1126/science.1247651</doi><unstructured_citation>Jaitin, D. A. , Kenigsberg, E. , Keren-Shaul, H. , Elefant, N. , Paul, F. , &amp; Zaretsky, I. , et al. (2014). Massively parallel single-cell rna-seq for marker-free decomposition of tissues into cell types. Science, 343(6172), 776-779. </unstructured_citation></citation><citation key="ref8"><doi>10.1101/003236</doi><unstructured_citation>Soumillon, M. , Cacchiarelli, D. , Semrau, S. , Oudenaarden, A. V. , &amp; Mikkelsen, T. . (2014). Characterization of directed differentiation by high-throughput single-cell rna-seq. bioRxiv. </unstructured_citation></citation><citation key="ref9"><doi>10.1016/j.cell.2015.05.002</doi><unstructured_citation>Macosko, E. Z. , Basu, A. , Satija, R. , Nemesh, J. , &amp; Mccarroll, S. A. . (2015). Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell, 161(5), 1202-1214. </unstructured_citation></citation><citation key="ref10"><doi>10.1101/035758</doi><unstructured_citation>Christoph, Ziegenhain, Beate, Vieth, Swati, &amp; Parekh, et al. (2017). Comparative analysis of single-cell rna sequencing methods. Molecular Cell. </unstructured_citation></citation><citation key="ref11"><doi>10.1038/nmeth.2772</doi><unstructured_citation>Islam, S. , Zeisel, A. , Joost, S. , La Manno, G. , Zajac, P. , &amp; Kasper, M. , et al. (2014). Quantitative single-cell rna-seq with unique molecular identifiers. Nature Methods, 11(2), 163-166. </unstructured_citation></citation><citation key="ref12"><doi>10.1007/s00018-014-1637-9</doi><unstructured_citation>De Klerk, E. , Den Dunnen, J. T. , &amp; ‘t Hoen, Peter A. C. (2014). Rna sequencing: from tagbased profiling to resolving complete transcript structure. Cellular &amp; Molecular Life Sciences, 71(18), 3537-3551. </unstructured_citation></citation><citation key="ref13"><doi>10.1093/bioinformatics/btr708</doi><unstructured_citation>Huang, W. , Li, L. , Myers, J. R. , &amp; Marth, G. T. . (2011). Art: a next-generation sequencing read simulator. Bioinformatics. </unstructured_citation></citation><citation key="ref14"><doi>10.1093/bioinformatics/btq365</doi><unstructured_citation>S., Balzer, K., Malde, A., &amp; Lanzen, et al. (2010). Characteristics of 454 pyrosequencing data--enabling realistic simulation with flowsim. Bioinformatics, 26(18), i420-i425. </unstructured_citation></citation><citation key="ref15"><doi>10.1093/nar/gks251</doi><unstructured_citation>Angly, F. E. , Dana, W. , Forest, R. , Philip, H. , &amp; Tyson, G. W. . (2012). Grinder: a versatile amplicon and shotgun sequence simulator. Nucleic Acids Research, 40(12), e94. </unstructured_citation></citation><citation key="ref16"><doi>10.1186/1756-0500-7-533</doi><unstructured_citation>Shcherbina, &amp; Anna. (2014). Fastqsim: platform-independent data characterization and in silico read generation for ngs datasets. BMC Research Notes, 7(1), 1-12. </unstructured_citation></citation><citation key="ref17"><doi>10.1093/bioinformatics/btv272</doi><unstructured_citation>Frazee, A. C. , Jaffe, A. E. , Ben, L. , &amp; Leek, J. T. . (2015). Polyester: simulating rna-seq datasets with differential transcript expression. Bioinformatics(17), 2778-84. </unstructured_citation></citation><citation key="ref18"><doi>10.1038/nrg.2016.57</doi><unstructured_citation>Escalona, M. , Rocha, S. , &amp; Posada, D. . A comparison of tools for the simulation of genomic next-generation sequencing data. Nature Reviews Genetics. </unstructured_citation></citation><citation key="ref19"><doi>10.1186/s13059-017-1305-0</doi><unstructured_citation>Zappia, L. , Phipson, B. , &amp; Oshlack, A. . (2017). Splatter: simulation of single-cell rna sequencing data. Genome Biology, 18(1), 174. </unstructured_citation></citation><citation key="ref20"><doi>10.1186/s13059-016-0947-7</doi><unstructured_citation>Lun, A. T. L. , Bach, K. , &amp; Marioni, J. C. . (2016). Pooling across cells to normalize singlecell rna sequencing data with many zero counts. Genome biology, 17(1), 75. </unstructured_citation></citation><citation key="ref21"><doi>10.1093/biostatistics/kxw055</doi><unstructured_citation>Lun, A. T. L. , &amp; Marioni, J. C. . (2017). Overcoming confounding plate effects in differential expression analyses of single-cell rna-seq data. Biostatistics, 18(3), 451-464. </unstructured_citation></citation><citation key="ref22"><doi>10.1371/journal.pcbi.1004333</doi><unstructured_citation>Vallejos, C. A. , Marioni, J. C. , Sylvia, R. , &amp; Quaid, M. . (2015). Basics: bayesian analysis of single-cell sequencing data. PLoS Computational Biology, 11(6), e1004333. </unstructured_citation></citation><citation key="ref23"><doi>10.1038/s41598-017-12989-x</doi><unstructured_citation>Rizzetto, S. , Eltahla, A. A. , Lin, P. , Bull, R. , Lloyd, A. R. , &amp; Ho, J. W. K. , et al. (2017). Impact of sequencing depth and read length on single cell rna sequencing data of t cells. Scientific Reports. </unstructured_citation></citation><citation key="ref24"><doi>10.1093/bioinformatics/bts635</doi><unstructured_citation>Dobin, A. , Davis, C. A. , Schlesinger, F. , Drenkow, J. , &amp; Gingeras, T. R. . (2012). Star: ultrafast universal rna-seq aligner. Bioinformatics, 29(1). </unstructured_citation></citation><citation key="ref25"><doi>10.1093/bioinformatics/btp692</doi><unstructured_citation>Dewey, C. N. . (2010). Rna-seq gene expression estimation with read mapping uncertainty. Bioinformatics, 26(4), 493-500. </unstructured_citation></citation><citation key="ref26"><doi>10.1186/s13059-018-1496-z</doi><unstructured_citation>ángeles, Arzalluz-Luque, Ana, &amp; Conesa. (2018). Single-cell rnaseq for the study of isoforms-how is that possible?. Genome biology.</unstructured_citation></citation></citation_list></journal_article></journal></body></doi_batch>