Quantitative Biology
See recent articles
Showing new listings for Tuesday, 31 December 2024
- [1] arXiv:2412.19812 [pdf, html, other]
-
Title: Pharmacophore-constrained de novo drug design with diffusion bridgeComments: 13 pages, 6 figures, 4 tablesSubjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG)
De novo design of bioactive drug molecules with potential to treat desired biological targets is a profound task in the drug discovery process. Existing approaches tend to leverage the pocket structure of the target protein to condition the molecule generation. However, even the pocket area of the target protein may contain redundant information since not all atoms in the pocket is responsible for the interaction with the ligand. In this work, we propose PP2Drug - a phamacophore-constrained de novo design approach to generate drug candidate with desired bioactivity. Our method adapts diffusion bridge to effectively convert pharmacophore designs in the spatial space into molecular structures under the manner of equivariant transformation, which provides sophisticated control over optimal biochemical feature arrangement on the generated molecules. PP2Drug is demonstrated to generate hit candidates that exhibit high binding affinity with potential protein targets.
- [2] arXiv:2412.19814 [pdf, other]
-
Title: Predicting Human Brain States with TransformerComments: 11 pages, 4 figures, MICCAI MMMI workshop in pressSubjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
The human brain is a complex and highly dynamic system, and our current knowledge of its functional mechanism is still very limited. Fortunately, with functional magnetic resonance imaging (fMRI), we can observe blood oxygen level-dependent (BOLD) changes, reflecting neural activity, to infer brain states and dynamics. In this paper, we ask the question of whether the brain states rep-resented by the regional brain fMRI can be predicted. Due to the success of self-attention and the transformer architecture in sequential auto-regression problems (e.g., language modelling or music generation), we explore the possi-bility of the use of transformers to predict human brain resting states based on the large-scale high-quality fMRI data from the human connectome project (HCP). Current results have shown that our model can accurately predict the brain states up to 5.04s with the previous 21.6s. Furthermore, even though the prediction error accumulates for the prediction of a longer time period, the gen-erated fMRI brain states reflect the architecture of functional connectome. These promising initial results demonstrate the possibility of developing gen-erative models for fMRI data using self-attention that learns the functional or-ganization of the human brain. Our code is available at: this https URL.
- [3] arXiv:2412.19815 [pdf, html, other]
-
Title: Enhancing Drug-Target Interaction Prediction through Transfer Learning from Activity Cliff Prediction TasksSubjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG)
Recently, machine learning (ML) has gained popularity in the early stages of drug discovery. This trend is unsurprising given the increasing volume of relevant experimental data and the continuous improvement of ML algorithms. However, conventional models, which rely on the principle of molecular similarity, often fail to capture the complexities of chemical interactions, particularly those involving activity cliffs (ACs) - compounds that are structurally similar but exhibit evidently different activity behaviors. In this work, we address two distinct yet related tasks: (1) activity cliff (AC) prediction and (2) drug-target interaction (DTI) prediction. Leveraging insights gained from the AC prediction task, we aim to improve the performance of DTI prediction through transfer learning. A universal model was developed for AC prediction, capable of identifying activity cliffs across diverse targets. Insights from this model were then incorporated into DTI prediction, enabling better handling of challenging cases involving ACs while maintaining similar overall performance. This approach establishes a strong foundation for integrating AC awareness into predictive models for drug discovery. Scientific Contribution This study presents a novel approach that applies transfer learning from AC prediction to enhance DTI prediction, addressing limitations of traditional similarity-based models. By introducing AC-awareness, we improve DTI model performance in structurally complex regions, demonstrating the benefits of integrating compound-specific and protein-contextual information. Unlike previous studies, which treat AC and DTI predictions as separate problems, this work establishes a unified framework to address both data scarcity and prediction challenges in drug discovery.
- [4] arXiv:2412.19831 [pdf, html, other]
-
Title: Leslie Population Models in Predator-prey and Competitive populations: theory and applications by machine learningSubjects: Populations and Evolution (q-bio.PE)
We introduce a new predator-prey model by replacing the growth and predation constant by a square matrix, and the population density as a population vector. The classical Lotka-Volterra model describes a population that either modulates or converges. Stability analysis of such models have been extensively studied by the works of Merdan (this https URL). The new model adds complexity by introducing an age group structure where the population of each age group evolves as prescribed by the Leslie matrix.
The added complexity changes the behavior of the model such that the population either displays roughly an exponential growth or decay. We first provide an exact equation that describes a time evolution and use analytic techniques to obtain an approximate growth factor. We also discuss the variants of the Leslie model, i.e., the complex value predator-prey model and the competitive model. We then prove the Last Species Standing theorem that determines the dominant population in the large time limit.
The recursive structure of the model denies the application of simple regression. We discuss a machine learning scheme that allows an admissible fit for the population evolution of Paramecium Aurelia and Paramecium Caudatum. Another potential avenue to simplify the computation is to use the machinery of quantum operators. We demonstrate the potential of this approach by computing the Hamiltonian of a simple Leslie system. - [5] arXiv:2412.19845 [pdf, other]
-
Title: Unveiling Secrets of Brain Function With Generative Modeling: Motion Perception in Primates & Cortical Network Organization in MiceComments: This is my PhD Dissertation, defended on November 3, 2023Subjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI)
This Dissertation is comprised of two main projects, addressing questions in neuroscience through applications of generative modeling.
Project #1 (Chapter 4) explores how neurons encode features of the external world. I combine Helmholtz's "Perception as Unconscious Inference" -- paralleled by modern generative models like variational autoencoders (VAE) -- with the hierarchical structure of the visual cortex. This combination leads to the development of a hierarchical VAE model, which I test for its ability to mimic neurons from the primate visual cortex in response to motion stimuli. Results show that the hierarchical VAE perceives motion similar to the primate brain. Additionally, the model identifies causal factors of retinal motion inputs, such as object- and self-motion, in a completely unsupervised manner. Collectively, these results suggest that hierarchical inference underlines the brain's understanding of the world, and hierarchical VAEs can effectively model this understanding.
Project #2 (Chapter 5) investigates the spatiotemporal structure of spontaneous brain activity and its reflection of brain states like rest. Using simultaneous fMRI and wide-field Ca2+ imaging data, this project demonstrates that the mouse cortex can be decomposed into overlapping communities, with around half of the cortical regions belonging to multiple communities. Comparisons reveal similarities and differences between networks inferred from fMRI and Ca2+ signals.
The introduction (Chapter 1) is divided similarly to this abstract: sections 1.1 to 1.8 provide background information about Project #1, and sections 1.9 to 1.13 are related to Project #2. Chapter 2 includes historical background, Chapter 3 provides the necessary mathematical background, and finally, Chapter 6 contains concluding remarks and future directions. - [6] arXiv:2412.19915 [pdf, other]
-
Title: Identifying Cocoa Pollinators: A Deep Learning DatasetWenxiu Xu, Saba Ghorbani Bazegar, Dong Sheng, Manuel Toledo-Hernandez, ZhenZhong Lan, Thomas Cherico WangerComments: The manuscript introduces the first cocoa pollination dataset and an example analysis with YOLOv8 modelsSubjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI)
Cocoa is a multi-billion-dollar industry but research on improving yields through pollination remains limited. New embedded hardware and AI-based data analysis is advancing information on cocoa flower visitors, their identity and implications for yields. We present the first cocoa flower visitor dataset containing 5,792 images of Ceratopogonidae, Formicidae, Aphididae, Araneae, and Encyrtidae, and 1,082 background cocoa flower images. This dataset was curated from 23 million images collected over two years by embedded cameras in cocoa plantations in Hainan province, China. We exemplify the use of the dataset with different sizes of YOLOv8 models and by progressively increasing the background image ratio in the training set to identify the best-performing model. The medium-sized YOLOv8 model achieved the best results with 8% background images (F1 Score of 0.71, mAP50 of 0.70). Overall, this dataset is useful to compare the performance of deep learning model architectures on images with low contrast images and difficult detection targets. The data can support future efforts to advance sustainable cocoa production through pollination monitoring projects.
- [7] arXiv:2412.19930 [pdf, html, other]
-
Title: On the Estimation of the Time-Dependent Transmission Rate in Epidemiological ModelsComments: 35 pages, 6 figures (with subfigures)Subjects: Populations and Evolution (q-bio.PE); Numerical Analysis (math.NA)
The COVID-19 pandemic highlighted the need to improve the modeling, estimation, and prediction of how infectious diseases spread. SEIR-like models have been particularly successful in providing accurate short-term predictions. This study fills a notable literature gap by exploring the following question: Is it possible to incorporate a nonparametric susceptible-exposed-infected-removed (SEIR) COVID-19 model into the inverse-problem regularization framework when the transmission coefficient varies over time? Our positive response considers varying degrees of disease severity, vaccination, and other time-dependent parameters. In addition, we demonstrate the continuity, differentiability, and injectivity of the operator that link the transmission parameter to the observed infection numbers. By employing Tikhonov-type regularization to the corresponding inverse problem, we establish the existence and stability of regularized solutions. Numerical examples using both synthetic and real data illustrate the model's estimation accuracy and its ability to fit the data effectively.
- [8] arXiv:2412.19951 [pdf, other]
-
Title: Validation of Subject-Specific Knee Models from In Vivo MeasurementsThor E. Andreassen, Donald R. Hume, Landon D. Hamilton, Stormy L. Hegg, Sean E. Higinbotham, Kevin B. ShelburneComments: 34 pages, 7 tables, 8 figuresSubjects: Quantitative Methods (q-bio.QM); Optimization and Control (math.OC)
Calibration to experimental data is vital when developing subject-specific models towards developing digital twins. Yet, to date, subject-specific models are largely based on cadaveric testing, as in vivo data to calibrate against has been difficult to obtain until recently. To support our overall goal of building subject-specific models of the living knee, we aimed to show that subject-specific computational models built and calibrated using in vivo measurements would have accuracy comparable to models built using in vitro measurements. Two knee specimens were imaged using a combination of computed tomography (CT), and surface scans. Knee laxity measurements were made with a custom apparatus used for the living knee and from a robotic knee simulator. Models of the knees were built using the CT geometry and surface scans, and then calibrated with either laxity data from the robotic knee simulator or from the knee laxity apparatus. Model performance was compared by simulation of passive flexion, knee laxity and a clinically relevant pivot shift. Performance was similar with differences during simulated anterior-posterior laxity tests of less than 2.5 mm. Additionally, model predictions of a pivot shift were similar with differences less than 3 deg or 3 mm for rotations and translations, respectively. Still, differences in the predicted ligament loads and calibrated material properties emerged, highlighting a need for methods to include ligament load as part of the underlying calibration process. Overall, the results showed that currently available methods of measuring knee laxity in vivo are sufficient to calibrate models comparable with existing in vitro techniques, and the workflows described here may provide a basis for modeling the living knee. The models, data, and code are publicly available.
- [9] arXiv:2412.20011 [pdf, other]
-
Title: AI Model for Predicting Binding Affinity of Antidiabetic Compounds Targeting PPARComments: Contain graphical abstract, 1 figure, and 3 tablesSubjects: Biomolecules (q-bio.BM)
This study aims to develop a deep learning model for predicting the binding affinity of ligands targeting the Peroxisome Proliferator-Activated Receptor (PPAR) family, using 2D molecular descriptors. A dataset of 3,764 small molecules with known binding affinities, sourced from the ChEMBL database, was preprocessed by eliminating duplicates and incomplete data. Molecular docking simulations using AutoDock Vina were performed to predict binding affinities for the PPAR receptor family. 2D molecular descriptors were computed from the SMILES notation of each ligand, capturing essential structural and physicochemical features. These descriptors, along with the predicted binding affinities, were used to train a deep learning model to predict binding affinity as a regression task. The model was evaluated using metrics such as Mean Squared Error (MSE), Mean Absolute Error (MAE), and R-squared. Results indicated strong performance with an R squared value of 0.861 for the training set and 0.655 for the test set, suggesting good model generalization. The model shows promise for predicting ligand-receptor interactions and can be applied in drug discovery efforts targeting PPAR-related diseases.
- [10] arXiv:2412.20038 [pdf, other]
-
Title: BioTD: an online database of biotoxinsGaoang Wang, Hang Wu, Yang Liao, Zhen Chen, Qing Zhou, Wenxing Wang, Yifei Liu, Yilin Wang, Meijing Wu, Ruiqi Xiang, Yuntao Yu, Xi Zhou, Feng Zhu, Zhonghua Liu, Tingjun HouSubjects: Biomolecules (q-bio.BM)
Biotoxins, mainly produced by venomous animals, plants and microorganisms, exhibit high physiological activity and unique effects such as lowering blood pressure and analgesia. A number of venom-derived drugs are already available on the market, with many more candidates currently undergoing clinical and laboratory studies. However, drug design resources related to biotoxins are insufficient, particularly a lack of accurate and extensive activity data. To fulfill this demand, we develop the Biotoxins Database (BioTD). BioTD is the largest open-source database for toxins, offering open access to 14,607 data records (8,185 activity records), covering 8,975 toxins sourced from 5,220 references and patents across over 900 species. The activity data in BioTD is categorized into five groups: Activity, Safety, Kinetics, Hemolysis and other physiological indicators. Moreover, BioTD provides data on 986 mutants, refines the whole sequence and signal peptide sequences of toxins, and annotates disulfide bond information. Given the importance of biotoxins and their associated data, this new database was expected to attract broad interests from diverse research fields in drug discovery. BioTD is freely accessible at this http URL.
- [11] arXiv:2412.20076 [pdf, other]
-
Title: Tractable size-structured fish growth models in natural environment with an application to an inland fishSubjects: Populations and Evolution (q-bio.PE); Probability (math.PR)
Modeling fish growth is an important research topic in ecological and fishery sciences because body weight statistics directly affect the total biomass of fish in a habitat, which in turn affects their population dynamics. Many models of fish growth assume that the fish population in a habitat is homogenous, meaning that there is no physiological spectrum and, therefore, no size spectrum. Moreover, models that account for the size spectrum are not always analytically tractable. We present novel mathematical models of fish growth in which the body weight of each fish is assumed to follow a von Bertalanffy-type model whose proportionality coefficient, representing the maximum body weight, may differ among individual fish. This probabilistic description introduces the size spectrum into the model, owing to which the time-dependent probability density of this model is obtained explicitly. We also consider a misspecified version and a stochastic version of the model as advanced cases. We apply the first model to the real growth data of Plecoglossus altivelis altivelis as a keystone fish species in Japan. The model successfully reproduces the skewed size spectrum of this fish species over multiple years. We further use the stochastic model to investigate how fish growth dynamics are affected by environmental fluctuations.
- [12] arXiv:2412.20113 [pdf, html, other]
-
Title: Intrinsic noise in the compartment model of time delays in evolutionary gamesComments: 12 pages, 8 figuresSubjects: Populations and Evolution (q-bio.PE)
We study the effects of strategy-dependent time delays in deterministic and stochastic compartment models of the Snowdrift game. In replicator dynamics with two compartments, adults and kindergarten, augmented by death rates, stationary states of population sizes and strategy frequencies depend continuously on time delays represented by transition rates between compartments. In the corresponding birth-death Markov jump processes we observe the novel behavior, time delays are beneficial for the cooperation strategy.
- [13] arXiv:2412.20202 [pdf, html, other]
-
Title: Revealing the Shape of Genome Space via K-mer TopologyComments: 51 pages, 16 figuresSubjects: Genomics (q-bio.GN); Algebraic Topology (math.AT)
Despite decades of effort, understanding the shape of genome space in biology remains a challenge due to the similarity, variability, diversity, and plasticity of evolutionary relationships among species, genes, or other biological entities. We present a k-mer topology method, the first of its kind, to delineate the shape of the genome space. K-mer topology examines the topological persistence and the evolution of the homotopic shape of the sequences of k nucleotides in species, organisms, and genes using persistent Laplacians, a new multiscale combinatorial approach. We also propose a topological genetic distance between species by their topological invariants and non-harmonic spectra over scales. This new metric defines the topological phylogenetic trees of genomes, facilitating species classification and clustering. K-mer topology substantially outperforms state-of-the-art methods on a variety of benchmark datasets, including mammalian mitochondrial genomes, Rhinovirus, SARS-CoV-2 variants, Ebola virus, Hepatitis E virus, Influenza hemagglutinin genes, and whole bacterial genomes. K-mer topology reveals the intrinsic shapes of the genome space and can be directly applied to the rational design of viral vaccines.
- [14] arXiv:2412.20245 [pdf, other]
-
Title: Machine-Learning Enabled Multidimensional Data Utilization in Multi-resonance Biosensors: A Pathway to Enhanced AccuracyComments: 31 pages total. References are before supplementary information at page 19. Supplementary information are placed after references at the end of the manuscriptSubjects: Quantitative Methods (q-bio.QM); Signal Processing (eess.SP)
A novel framework is proposed that combines multi-resonance biosensors with machine learning (ML) to significantly enhance the accuracy of parameter prediction in biosensing. Unlike traditional single-resonance systems, which are limited to one-dimensional datasets, this approach leverages multi-dimensional data generated by a custom-designed nanostructure, a periodic array of silicon nanorods with a triangular cross-section over an aluminum reflector. High bulk sensitivity values are achieved for this multi-resonant structure, with certain resonant peaks reaching up to 1706 nm/RIU. The predictive power of multiple resonant peaks from transverse magnetic (TM) and transverse electric (TE) polarizations is evaluated using Ridge Regression modeling. Systematic analysis reveals that incorporating multiple resonances yields up to three orders of magnitude improvement in refractive index detection precision compared to single-peak analyses. This precision enhancement is achieved without modifications to the biosensor hardware, highlighting the potential of data-centric strategies in biosensing. The findings establish a new paradigm in biosensing, demonstrating that the synergy between multi-resonance data acquisition and ML-based analysis can significantly enhance detection accuracy. This study provides a scalable pathway for advancing high-precision biosensing technologies.
- [15] arXiv:2412.20445 [pdf, html, other]
-
Title: Molecular Communication-Based Quorum Sensing Disruption for Enhanced Immune DefenseSubjects: Biomolecules (q-bio.BM)
Molecular Communication (MC) utilizes chemical molecules to transmit information, introducing innovative strategies for pharmaceutical interventions and enhanced immune system monitoring. This paper explores Molecular communication based approach to disrupt Quorum Sensing (QS) pathways to bolster immune defenses against antimicrobial-resistant bacteria. Quorum Sensing enables bacteria to coordinate critical behaviors, including virulence and antibiotic resistance, by exchanging chemical signals, known as autoinducers. By interfering with this bacterial communication, we can disrupt the synchronization of activities that promote infection and resistance. The study focuses on RNAIII inhibiting peptide (RIP), which blocks the production of critical transcripts, RNAII and RNAIII, within the Accessory Gene Regulator (AGR) system, thereby weakening bacterial virulence and enhancing host immune responses. The synergistic effects of combining QS inhibitors like RIP with traditional antimicrobial treatments reduce the need for highdose antibiotics, offering a potential solution to antibiotic resistance. This molecular communication-based approach presents a promising path to improved treatment efficacy and more robust immune responses against bacterial infections by targeting bacterial communication.
- [16] arXiv:2412.20550 [pdf, html, other]
-
Title: Machine learning discoveries of ASCL2-X synergy in ETC-1922159 treated colorectal cancer cellsSubjects: Molecular Networks (q-bio.MN)
Achaete-scute complex homolog 2 (ASCL2) codes a part of the basic helix-loop-helix (BHLH) transcription factor family. WNTs have been found to directly affect the stemness of the tumor cells via regulation of ASCL2. Switching off the ASCL2 literally blocks the stemness process of the tumor cells and vice versa. In colorectal cancer (CRC) cells treated with ETC-1922159, ASCL2 was found to be down regulated along with other genes. A recently developed search engine ranked combinations of ASCL2-X (X, a particular gene/protein) at 2nd order level after drug administration. Some rankings confirm the already tested combinations, while others point to those that are untested/unexplored. These rankings reveal which ASCL2-X combinations might be working synergistically in CRC. In this research work, I cover combinations of ASCL2 with WNT, transforming growth factor beta (TGFB), interleukin (IL), leucine rich repeat containing G protein-coupled receptor (LGR), NOTCH, solute carrier family (SLC), SRY-box transcription factor (SOX), small nucleolar RNA host gene (SNHG), KIAA, F-box protein (FBXO), family with sequence similarity (FAM), B cell CLL/lymphoma (BCL), autophagy related (ATG) and Rho GTPase activating protein (ARHGAP) family.
- [17] arXiv:2412.20568 [pdf, html, other]
-
Title: Derivations of Animal Movement Models with Explicit MemorySubjects: Populations and Evolution (q-bio.PE); Quantitative Methods (q-bio.QM)
Highly evolved animals continuously update their knowledge of social factors, refining movement decisions based on both historical and real-time observations. Despite its significance, research on the underlying mechanisms remains limited. In this study, we explore how the use of explicit memory shapes different mathematical models across various ecological dispersal scenarios. Specifically, we investigate three memory-based dispersal scenarios: gradient-based movement, where individuals respond to environmental gradients; environment matching, which promotes uniform distribution within a population; and location-based movement, where decisions rely solely on local suitability. These scenarios correspond to diffusion advection, Fickian diffusion, and Fokker-Planck diffusion models, respectively. We focus on the derivation of these memory-based movement models using three approaches: spatial and temporal discretization, patch models in continuous time, and discrete-velocity jump process. These derivations highlight how different ways of using memory lead to distinct mathematical models. Numerical simulations reveal that the three dispersal scenarios exhibit distinct behaviors under memory-induced repulsive and attractive conditions. The diffusion advection and Fokker-Planck models display wiggle patterns and aggregation phenomena, while simulations of the Fickian diffusion model consistently stabilize to uniform constant states.
- [18] arXiv:2412.20873 [pdf, other]
-
Title: Neurophenomenal Structuralism and the Role of Computational ContextComments: 29 pages, 7 figures, submission for the "Structuralism in Consciousness Studies" @ Berlin 2023Subjects: Neurons and Cognition (q-bio.NC)
Neurophenomenal structuralism posits that conscious experiences are defined relationally and that their phenomenal structures are mirrored by neural structures. While this approach offers a promising framework for identifying neural correlates of contents of consciousness (NCCCs), we argue that merely establishing structural correspondences between neural and phenomenal structures is insufficient. This paper emphasizes the critical role of computational context in determining the content of neural structures. We introduce four criteria - Sensitivity, Organization, Exploitation, and Contextualization - to evaluate which neural structures are viable NCCC candidates. These criteria highlight that, for neural structures to meaningfully mirror phenomenal structures they have to be actively exploited and be able to influence behavior in a structure-preserving way. Our analysis demonstrates that anatomical and causal neural structures fail to meet certain criteria, whereas activation structures can, provided they are embedded within the appropriate computational context. Our findings challenge both local and rich global structuralist theories for overlooking the content-constituting role of computational context, leading to proposed NCCCs that fail to fully account for conscious content. We conclude that incorporating computational context is essential for any structuralist account of consciousness, as it determines the nature of dimensions within neural activation spaces and, consequently, the content of conscious experiences.
- [19] arXiv:2412.20933 [pdf, html, other]
-
Title: ProtScan: Modeling and Prediction of RNA-Protein InteractionsComments: 19 pages, 4 figuresSubjects: Biomolecules (q-bio.BM)
CLIP-seq methods are valuable techniques to experimentally determine transcriptome-wide binding sites of RNA-binding proteins. Despite the constant improvement of such techniques (e.g. eCLIP), the results are affected by various types of noise and depend on experimental conditions such as cell line, tissue, gene expression levels, stress conditions etc., paving the way for the in silico modeling of RNA-protein interactions. Here we present ProtScan, a predictive tool based on consensus kernelized SGD regression. ProtScan denoises and generalizes the information contained in CLIP-seq experiments. It outperforms competitor state-of the-art methods and can be used to model RNA-protein interactions on a transcriptome-wide scale.
- [20] arXiv:2412.21025 [pdf, html, other]
-
Title: Considering experimental frame rates and robust segmentation analysis of piecewise-linear microparticle trajectoriesSubjects: Quantitative Methods (q-bio.QM)
The movement of intracellular cargo transported by molecular motors is commonly marked by switches between directed motion and stationary pauses. The predominant measure for assessing movement is effective diffusivity, which predicts the mean-squared displacement of particles over long time scales. In this work, we consider an alternative analysis regime that focuses on shorter time scales and relies on automated segmentation of paths. Due to intrinsic uncertainty in changepoint analysis, we highlight the importance of statistical summaries that are robust with respect to the performance of segmentation algorithms. In contrast to effective diffusivity, which averages over multiple behaviors, we emphasize tools that highlight the different motor-cargo states, with an eye toward identifying biophysical mechanisms that determine emergent whole-cell transport properties. By developing a Markov chain model for noisy, continuous, piecewise-linear microparticle movement, and associated mathematical analysis, we provide insight into a common question posed by experimentalists: how does the choice of observational frame rate affect what is inferred about transport properties?
- [21] arXiv:2412.21076 [pdf, html, other]
-
Title: Pharmacometrics Modeling via Physics-Informed Neural Networks: Integrating Time-Variant Absorption Rates and Fractional Calculus for Enhancing Prediction AccuracyJournal-ref: CMBE 2024 Proceedings Vol. 2Subjects: Quantitative Methods (q-bio.QM)
We present a novel method to improve pharmacokinetics modeling, an essential step of drug development. Conventional models frequently fail to fully represent the intricacies of drug absorption and distribution, which limits their predictive abilities required for personalized treatment strategies. Our methodology introduces two innovations to enhance modeling accuracy: 1. Time-varying parameters: this approach is designed to accommodate the dynamic nature of drug absorption rates. 2. Fractional calculus in representing delayed drug response. This approach effectively captures anomalous diffusion phenomena, surpassing traditional models in describing drug delayed response without the need for extensive compartmentalization.
- [22] arXiv:2412.21081 [pdf, html, other]
-
Title: The FlEye camera: Sampling the joint distribution of natural scenes and motionSubjects: Neurons and Cognition (q-bio.NC); Disordered Systems and Neural Networks (cond-mat.dis-nn)
To make efficient use of limited physical resources, the brain must match its coding and computational strategies to the statistical structure of input signals. An attractive testing ground for these principles is the problem of motion estimation in the fly visual system: we understand the optics of the compound eye, have a quantitative description of input signals and noise from the retina, and can record from output neurons that encode estimates of different velocity components. Furthermore, recent work provides a nearly complete wiring diagram of the intervening circuitry. What is missing is a characterization of the visual signals and motions that flies encounter in a natural context. We attack this directly with the development of a specialized camera that matches the high temporal resolution, optical properties, and spectral sensitivity of the fly's eye; inertial motion sensors provide ground truth about rotations and translations through the world. We describe the design, construction, and performance characteristics of this FlEye camera. To illustrate the opportunities created by this instrument we use data on movies and motion to construct optimal local motion estimators that can be compared with the responses of the fly's motion sensitive neurons.
- [23] arXiv:2412.21111 [pdf, html, other]
-
Title: Intrinsic meaning, perception, and matchingComments: 47 pages, 8 figuresSubjects: Neurons and Cognition (q-bio.NC)
Integrated information theory (IIT) argues that the substrate of consciousness is a maximally irreducible complex of units. Together, subsets of the complex specify a cause-effect structure, composed of distinctions and their relations, which accounts in full for the quality of experience. The feeling of a specific experience is also its meaning for the subject, which is thus defined intrinsically, regardless of whether the experience occurs in a dream or is triggered by processes in the environment. Here we extend IIT's framework to characterize the relationship between intrinsic meaning, extrinsic stimuli, and causal processes in the environment, illustrated using a simple model of a sensory hierarchy. We argue that perception should be considered as a structured interpretation, where a stimulus from the environment acts merely as a trigger a system's state and the structure is provided by the complex's intrinsic connectivity. We also propose that perceptual differentiation -- the richness and diversity of structures triggered by representative sequences of stimuli -- quantifies the meaningfulness of different environments to a complex. In adaptive systems, this reflects the "matching" between intrinsic meanings and causal processes in an environment.
- [24] arXiv:2412.21159 [pdf, other]
-
Title: A Standardized Framework for Sensor Placement in Human Motion Capture and Wearable ApplicationsComments: 7 pages, 1 Table with a Figure inside. GitHub Rpostiroy and Page are available from the code availability sectionSubjects: Quantitative Methods (q-bio.QM)
The proliferation of wearable sensors and monitoring technologies has created an urgent need for standardized sensor placement protocols. While existing standards like SENIAM address specific applications, no comprehensive framework spans different sensing modalities and applications. We present a unified sensor placement standard that ensures the reproducibility and transferability of human movement and physiological data across various systems and research domains. Our framework provides precise anatomical landmarks, coordinate systems, and placement protocols with defined precision levels, compatible with existing data-sharing standards such as the Brain Imaging Data Structure (BIDS) and Heirechciacal Event Descriptors (HED). This framework aims to enhance data quality, reproducibility, and interoperability in applications ranging from lab-based clinical biomechanics to continuous health monitoring in everyday life.
- [25] arXiv:2412.21178 [pdf, html, other]
-
Title: Two-component spatiotemporal template for activation-inhibition of speech in ECoGSubjects: Neurons and Cognition (q-bio.NC); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
I compute the average trial-by-trial power of band-limited speech activity across epochs of multi-channel high-density electrocorticography (ECoG) recorded from multiple subjects during a consonant-vowel speaking task. I show that previously seen anti-correlations of average beta frequency activity (12-35 Hz) to high-frequency gamma activity (70-140 Hz) during speech movement are observable between individual ECoG channels in the sensorimotor cortex (SMC). With this I fit a variance-based model using principal component analysis to the band-powers of individual channels of session-averaged ECoG data in the SMC and project SMC channels onto their lower-dimensional principal components.
Spatiotemporal relationships between speech-related activity and principal components are identified by correlating the principal components of both frequency bands to individual ECoG channels over time using windowed correlation. Correlations of principal component areas to sensorimotor areas reveal a distinct two-component activation-inhibition-like representation for speech that resembles distinct local sensorimotor areas recently shown to have complex interplay in whole-body motor control, inhibition, and posture. Notably the third principal component shows insignificant correlations across all subjects, suggesting two components of ECoG are sufficient to represent SMC activity during speech movement. - [26] arXiv:2412.21188 [pdf, html, other]
-
Title: Sparse chaos in cortical circuitsSubjects: Neurons and Cognition (q-bio.NC); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG); Chaotic Dynamics (nlin.CD)
Nerve impulses, the currency of information flow in the brain, are generated by an instability of the neuronal membrane potential dynamics. Neuronal circuits exhibit collective chaos that appears essential for learning, memory, sensory processing, and motor control. However, the factors controlling the nature and intensity of collective chaos in neuronal circuits are not well understood. Here we use computational ergodic theory to demonstrate that basic features of nerve impulse generation profoundly affect collective chaos in neuronal circuits. Numerically exact calculations of Lyapunov spectra, Kolmogorov-Sinai-entropy, and upper and lower bounds on attractor dimension show that changes in nerve impulse generation in individual neurons moderately impact information encoding rates but qualitatively transform phase space structure. Specifically, we find a drastic reduction in the number of unstable manifolds, Kolmogorov-Sinai entropy, and attractor dimension. Beyond a critical point, marked by the simultaneous breakdown of the diffusion approximation, a peak in the largest Lyapunov exponent, and a localization transition of the leading covariant Lyapunov vector, networks exhibit sparse chaos: prolonged periods of near stable dynamics interrupted by short bursts of intense chaos. Analysis of large, more realistically structured networks supports the generality of these findings. In cortical circuits, biophysical properties appear tuned to this regime of sparse chaos. Our results reveal a close link between fundamental aspects of single-neuron biophysics and the collective dynamics of cortical circuits, suggesting that nerve impulse generation mechanisms are adapted to enhance circuit controllability and information flow.
New submissions (showing 26 of 26 entries)
- [27] arXiv:2412.19875 (cross-list from physics.bio-ph) [pdf, other]
-
Title: Biological Insights from Integrative Modeling of Intrinsically Disordered Protein SystemsSubjects: Biological Physics (physics.bio-ph); Biomolecules (q-bio.BM)
Intrinsically disordered proteins and regions are increasingly appreciated for their abundance in the proteome and the many functional roles they play in the cell. In this short review, we describe a variety of approaches used to obtain biological insight from the structural ensembles of disordered proteins, regions, and complexes and the integrative biology challenges that arise from combining diverse experiments and computational models. Importantly, we highlight findings regarding structural and dynamic characterization of disordered regions involved in binding and phase separation, as well as drug targeting of disordered regions, using a broad framework of integrative modeling approaches.
- [28] arXiv:2412.19999 (cross-list from cs.CV) [pdf, html, other]
-
Title: Comprehensive Review of EEG-to-Output Research: Decoding Neural Signals into Images, Videos, and AudioComments: 15 pages. Submitted as a conference paper to IntelliSys 2025Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
Electroencephalography (EEG) is an invaluable tool in neuroscience, offering insights into brain activity with high temporal resolution. Recent advancements in machine learning and generative modeling have catalyzed the application of EEG in reconstructing perceptual experiences, including images, videos, and audio. This paper systematically reviews EEG-to-output research, focusing on state-of-the-art generative methods, evaluation metrics, and data challenges. Using PRISMA guidelines, we analyze 1800 studies and identify key trends, challenges, and opportunities in the field. The findings emphasize the potential of advanced models such as Generative Adversarial Networks (GANs), Variational Autoencoders (VAEs), and Transformers, while highlighting the pressing need for standardized datasets and cross-subject generalization. A roadmap for future research is proposed that aims to improve decoding accuracy and broadening real-world applications.
- [29] arXiv:2412.20012 (cross-list from math.PR) [pdf, html, other]
-
Title: The asymptotic distribution of the $k$-Robinson-Foulds dissimilarity measure on labelled treesComments: 16 pages, 2 figuresSubjects: Probability (math.PR); Combinatorics (math.CO); Populations and Evolution (q-bio.PE)
Motivated by applications in medical bioinformatics, Khayatian et al. (2024) introduced a family of metrics on Cayley trees (the $k$-RF distance, for $k=0, \ldots, n-2$) and explored their distribution on pairs of random Cayley trees via simulations. In this paper, we investigate this distribution mathematically, and derive exact asymptotic descriptions of the distribution of the $k$-RF metric for the extreme values $k=0$ and $k=n-2$, as $n$ becomes large. We show that a linear transform of the $0$-RF metric converges to a Poisson distribution (with mean 2) whereas a similar transform for the $(n-2)$-RF metric leads to a normal distribution (with mean $\sim ne^{-2}$). These results (together with the case $k=1$ which behaves quite differently, and $k=n-3$) shed light on the earlier simulation results, and the predictions made concerning them.
- [30] arXiv:2412.20014 (cross-list from cs.LG) [pdf, html, other]
-
Title: ProtCLIP: Function-Informed Protein Multi-Modal LearningJournal-ref: AAAI 2025Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Biomolecules (q-bio.BM)
Multi-modality pre-training paradigm that aligns protein sequences and biological descriptions has learned general protein representations and achieved promising performance in various downstream applications. However, these works were still unable to replicate the extraordinary success of language-supervised visual foundation models due to the ineffective usage of aligned protein-text paired data and the lack of an effective function-informed pre-training paradigm. To address these issues, this paper curates a large-scale protein-text paired dataset called ProtAnno with a property-driven sampling strategy, and introduces a novel function-informed protein pre-training paradigm. Specifically, the sampling strategy determines selecting probability based on the sample confidence and property coverage, balancing the data quality and data quantity in face of large-scale noisy data. Furthermore, motivated by significance of the protein specific functional mechanism, the proposed paradigm explicitly model protein static and dynamic functional segments by two segment-wise pre-training objectives, injecting fine-grained information in a function-informed manner. Leveraging all these innovations, we develop ProtCLIP, a multi-modality foundation model that comprehensively represents function-aware protein embeddings. On 22 different protein benchmarks within 5 types, including protein functionality classification, mutation effect prediction, cross-modal transformation, semantic similarity inference and protein-protein interaction prediction, our ProtCLIP consistently achieves SOTA performance, with remarkable improvements of 75% on average in five cross-modal transformation benchmarks, 59.9% in GO-CC and 39.7% in GO-BP protein function prediction. The experimental results verify the extraordinary potential of ProtCLIP serving as the protein multi-modality foundation model.
- [31] arXiv:2412.20060 (cross-list from eess.SP) [pdf, other]
-
Title: Self-Calibrated Dual Contrasting for Annotation-Efficient Bacteria Raman Spectroscopy Clustering and ClassificationSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
Raman scattering is based on molecular vibration spectroscopy and provides a powerful technology for pathogenic bacteria diagnosis using the unique molecular fingerprint information of a substance. The integration of deep learning technology has significantly improved the efficiency and accuracy of intelligent Raman spectroscopy (RS) recognition. However, the current RS recognition methods based on deep neural networks still require the annotation of a large amount of spectral data, which is labor-intensive. This paper presents a novel annotation-efficient Self-Calibrated Dual Contrasting (SCDC) method for RS recognition that operates effectively with few or no annotation. Our core motivation is to represent the spectrum from two different perspectives in two distinct subspaces: embedding and category. The embedding perspective captures instance-level information, while the category perspective reflects category-level information. Accordingly, we have implemented a dual contrastive learning approach from two perspectives to obtain discriminative representations, which are applicable for Raman spectroscopy recognition under both unsupervised and semi-supervised learning conditions. Furthermore, a self-calibration mechanism is proposed to enhance robustness. Validation of the identification task on three large-scale bacterial Raman spectroscopy datasets demonstrates that our SCDC method achieves robust recognition performance with very few (5$\%$ or 10$\%$) or no annotations, highlighting the potential of the proposed method for biospectral identification in annotation-efficient clinical scenarios.
- [32] arXiv:2412.20292 (cross-list from cs.LG) [pdf, html, other]
-
Title: An analytic theory of creativity in convolutional diffusion modelsSubjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC); Machine Learning (stat.ML)
We obtain the first analytic, interpretable and predictive theory of creativity in convolutional diffusion models. Indeed, score-based diffusion models can generate highly creative images that lie far from their training data. But optimal score-matching theory suggests that these models should only be able to produce memorized training examples. To reconcile this theory-experiment gap, we identify two simple inductive biases, locality and equivariance, that: (1) induce a form of combinatorial creativity by preventing optimal score-matching; (2) result in a fully analytic, completely mechanistically interpretable, equivariant local score (ELS) machine that, (3) without any training can quantitatively predict the outputs of trained convolution only diffusion models (like ResNets and UNets) with high accuracy (median $r^2$ of $0.90, 0.91, 0.94$ on CIFAR10, FashionMNIST, and MNIST). Our ELS machine reveals a locally consistent patch mosaic model of creativity, in which diffusion models create exponentially many novel images by mixing and matching different local training set patches in different image locations. Our theory also partially predicts the outputs of pre-trained self-attention enabled UNets (median $r^2 \sim 0.75$ on CIFAR10), revealing an intriguing role for attention in carving out semantic coherence from local patch mosaics.
- [33] arXiv:2412.20570 (cross-list from cond-mat.soft) [pdf, html, other]
-
Title: Voltage laws in nanodomains revealed by asymptotics and simulations of electro-diffusion equationsComments: 30 pages, 7 figures, 3 tablesSubjects: Soft Condensed Matter (cond-mat.soft); Analysis of PDEs (math.AP); Subcellular Processes (q-bio.SC)
Characterizing the local voltage distribution within nanophysiological domains, driven by ionic currents through membrane channels, is crucial for studying cellular activity in modern biophysics, yet it presents significant experimental and theoretical challenges. Theoretically, the complexity arises from the difficulty of solving electro-diffusion equations in three-dimensional domains. Currently, there are no methods available for obtaining asymptotic computations or approximated solutions of nonlinear equations, and numerically, it is challenging to explore solutions across both small and large spatial scales. In this work, we develop a method to solve the Poisson-Nernst-Planck equations with ionic currents entering and exiting through two narrow, circular window channels located on the boundary. The inflow through the first window is composed of a single cation, while the outflow maintains a constant ionic density satisfying local electro-neutrality conditions. Employing regular expansions and Green's function representations, we derive the ionic profiles and voltage drops in both small and large charge regimes. We explore how local surface curvature and window channels size influence voltage dynamics and validate our theoretical predictions through numerical simulations, assessing the accuracy of our asymptotic computations. These novel relationships between current, voltage, concentrations and geometry can enhance the characterization of physiological behaviors of nanodomains.
- [34] arXiv:2412.20616 (cross-list from cs.LG) [pdf, html, other]
-
Title: Hilbert Curve Based Molecular Sequence AnalysisSubjects: Machine Learning (cs.LG); Other Quantitative Biology (q-bio.OT)
Accurate molecular sequence analysis is a key task in the field of bioinformatics. To apply molecular sequence classification algorithms, we first need to generate the appropriate representations of the sequences. Traditional numeric sequence representation techniques are mostly based on sequence alignment that faces limitations in the form of lack of accuracy. Although several alignment-free techniques have also been introduced, their tabular data form results in low performance when used with Deep Learning (DL) models compared to the competitive performance observed in the case of image-based data. To find a solution to this problem and to make Deep Learning (DL) models function to their maximum potential while capturing the important spatial information in the sequence data, we propose a universal Hibert curve-based Chaos Game Representation (CGR) method. This method is a transformative function that involves a novel Alphabetic index mapping technique used in constructing Hilbert curve-based image representation from molecular sequences. Our method can be globally applied to any type of molecular sequence data. The Hilbert curve-based image representations can be used as input to sophisticated vision DL models for sequence classification. The proposed method shows promising results as it outperforms current state-of-the-art methods by achieving a high accuracy of $94.5$\% and an F1 score of $93.9\%$ when tested with the CNN model on the lung cancer dataset. This approach opens up a new horizon for exploring molecular sequence analysis using image classification methods.
Cross submissions (showing 8 of 8 entries)
- [35] arXiv:2207.01586 (replaced) [pdf, html, other]
-
Title: Accurate RNA 3D structure prediction using a language model-based deep learning approachTao Shen, Zhihang Hu, Siqi Sun, Di Liu, Felix Wong, Jiuming Wang, Jiayang Chen, Yixuan Wang, Liang Hong, Jin Xiao, Liangzhen Zheng, Tejas Krishnamoorthi, Irwin King, Sheng Wang, Peng Yin, James J. Collins, Yu LiSubjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG); Biomolecules (q-bio.BM)
Accurate prediction of RNA three-dimensional (3D) structure remains an unsolved challenge. Determining RNA 3D structures is crucial for understanding their functions and informing RNA-targeting drug development and synthetic biology design. The structural flexibility of RNA, which leads to scarcity of experimentally determined data, complicates computational prediction efforts. Here, we present RhoFold+, an RNA language model-based deep learning method that accurately predicts 3D structures of single-chain RNAs from sequences. By integrating an RNA language model pre-trained on ~23.7 million RNA sequences and leveraging techniques to address data scarcity, RhoFold+ offers a fully automated end-to-end pipeline for RNA 3D structure prediction. Retrospective evaluations on RNA-Puzzles and CASP15 natural RNA targets demonstrate RhoFold+'s superiority over existing methods, including human expert groups. Its efficacy and generalizability are further validated through cross-family and cross-type assessments, as well as time-censored benchmarks. Additionally, RhoFold+ predicts RNA secondary structures and inter-helical angles, providing empirically verifiable features that broaden its applicability to RNA structure and function studies.
- [36] arXiv:2307.10246 (replaced) [pdf, html, other]
-
Title: Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)Subba Reddy Oota, Zijiao Chen, Manish Gupta, Raju S. Bapi, Gael Jobard, Frederic Alexandre, Xavier HinautComments: 61 pages, 22 figuresJournal-ref: Published in Transactions on Machine Learning Research (12/2024)Subjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
Can artificial intelligence unlock the secrets of the human brain? How do the inner mechanisms of deep learning models relate to our neural circuits? Is it possible to enhance AI by tapping into the power of brain recordings? These captivating questions lie at the heart of an emerging field at the intersection of neuroscience and artificial intelligence. Our survey dives into this exciting domain, focusing on human brain recording studies and cutting-edge cognitive neuroscience datasets that capture brain activity during natural language processing, visual perception, and auditory experiences. We explore two fundamental approaches: encoding models, which attempt to generate brain activity patterns from sensory inputs; and decoding models, which aim to reconstruct our thoughts and perceptions from neural signals. These techniques not only promise breakthroughs in neurological diagnostics and brain-computer interfaces but also offer a window into the very nature of cognition. In this survey, we first discuss popular representations of language, vision, and speech stimuli, and present a summary of neuroscience datasets. We then review how the recent advances in deep learning transformed this field, by investigating the popular deep learning based encoding and decoding architectures, noting their benefits and limitations across different sensory modalities. From text to images, speech to videos, we investigate how these models capture the brain's response to our complex, multimodal world. While our primary focus is on human studies, we also highlight the crucial role of animal models in advancing our understanding of neural mechanisms. Throughout, we mention the ethical implications of these powerful technologies, addressing concerns about privacy and cognitive liberty. We conclude with a summary and discussion of future trends in this rapidly evolving field.
- [37] arXiv:2310.02734 (replaced) [pdf, other]
-
Title: Numerical modeling of hydrogel scaffold anisotropy during extrusion-based 3D printing for tissue engineeringSubjects: Tissues and Organs (q-bio.TO); Molecular Networks (q-bio.MN)
Extrusion-based 3D printing is a widely utilized tool in tissue engineering, offering precise 3D control of bioinks to construct organ-sized biomaterial objects with hierarchically organized cellularized scaffolds. The internal organization of scaffold constituents must replicate the structural anisotropy of the targeted tissue to effectively promote cellular behavior during 3D cell culture. The choice of polymers in the bioink and extrusion process topological properties significantly impact tissue engineering constructs' structural anisotropy and cellular response. Our study employed a hydrogel bioink consisting of fibrinogen, alginate, and gelatin, providing biocompatibility, printability, and shape retention post-printing. Topological properties in flowing polymers are determined by macromolecule conformation, namely orientation and stretch degree. We utilized the micro-macro approach to describe hydrogel macromolecule orientation during extrusion, offering a two-scale fluid behavior description. The study aimed to use the Fokker-Planck equation to represent constituent population (polymer chain) state within a hydrogel's representative elementary volume during extrusion-based 3D printing. Our findings indicate that a high shear rate drives constituent orientation in tubular nozzle syringe setups, overcoming fluid rheological behavior. Additionally, the interaction coefficient (C_i), representing microscopic fluid particle interaction, surpasses hydrogel behavior for constituent orientation prediction. This approach provides an initial but robust framework to model scaffold anisotropy, enabling optimization of the extrusion process while maintaining computational feasibility.
- [38] arXiv:2310.09758 (replaced) [pdf, other]
-
Title: Genome hybridization: A universal way for the origin and diversification of organelles as well as the origin and speciation of eukaryotesComments: 23 pages with two tables; added references for section 4; revised and added testable predictions for Section 5Subjects: Other Quantitative Biology (q-bio.OT)
The origin of organelles (mitochondrion, chloroplast and nucleus) remains enigmatic. The endosymbiotic hypothesis that chloroplasts, mitochondria and nuclei descend from the endosymbiotic cyanobacterium, bacterium and archaebacterium respectively is dominant yet uncompelling, while our discovery of de novo organelle biogenesis in the cyanobacterium TDX16 that had acquired the genome of its green algal host Haematococcus pluvialis overturns this hypothesis. In light of organelle biogenesis in the cyanobacterium TDX16 in combination with the relevant cellular and molecular evidence, we propose genome hybridization hypothesis (GHH) that the origin of organelles and origin of eukaryotes as well as the diversification of organelles and speciation of eukaryotes are unified and achieved by genome hybridization: the endosymbiotic cyanobacteria or bacteria obtain genomes of their archaebacterial or eukaryotic hosts and hybridize with their own ones resulting in expanded genomes containing a mixture of hybrid prokaryotic genes and eukaryotic genes, and thus the cyanobacteria or bacteria have to compartmentalize to accommodate different genes for specialized function of photosynthesis (chloroplast), respiration (mitochondrion) and DNA preservation (nucleus), and consequently turn into photosynthetic or heterotrophic eukaryotes. Accordingly, eukaryotes and their organelles are of multiple origin, while the formation of cancer cells is the speciation of eukaryotes as cancer cells are new species of unicellular eukaryotes arising from bacteria. Therefore, GHH provides a theoretical framework unifying evolutionary biology, cancer biology and cell biology and directing the integrated multidisciplinary research.
- [39] arXiv:2401.03376 (replaced) [pdf, other]
-
Title: How to optimize neuroscience data utilization and experiment design for advancing brain models of visual and linguistic cognition?Greta Tuckute, Dawn Finzi, Eshed Margalit, Joel Zylberberg, SueYeon Chung, Alona Fyshe, Evelina Fedorenko, Nikolaus Kriegeskorte, Jacob Yates, Kalanit Grill-Spector, Kohitij KarSubjects: Neurons and Cognition (q-bio.NC)
In recent years, neuroscience has made significant progress in building large-scale artificial neural network (ANN) models of brain activity and behavior. However, there is no consensus on the most efficient ways to collect data and design experiments to develop the next generation of models. This article explores the controversial opinions that have emerged on this topic in the domain of vision and language. Specifically, we address two critical points. First, we weigh the pros and cons of using qualitative insights from empirical results versus raw experimental data to train models. Second, we consider model-free (intuition-based) versus model-based approaches for data collection, specifically experimental design and stimulus selection, for optimal model development. Finally, we consider the challenges of developing a synergistic approach to experimental design and model building, including encouraging data and model sharing and the implications of iterative additions to existing models. The goal of the paper is to discuss decision points and propose directions for both experimenters and model developers in the quest to understand the brain.
- [40] arXiv:2402.05961 (replaced) [pdf, html, other]
-
Title: Genetic-guided GFlowNets for Sample Efficient Molecular OptimizationComments: NeurIPS 2024Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
The challenge of discovering new molecules with desired properties is crucial in domains like drug discovery and material design. Recent advances in deep learning-based generative methods have shown promise but face the issue of sample efficiency due to the computational expense of evaluating the reward function. This paper proposes a novel algorithm for sample-efficient molecular optimization by distilling a powerful genetic algorithm into deep generative policy using GFlowNets training, the off-policy method for amortized inference. This approach enables the deep generative policy to learn from domain knowledge, which has been explicitly integrated into the genetic algorithm. Our method achieves state-of-the-art performance in the official molecular optimization benchmark, significantly outperforming previous methods. It also demonstrates effectiveness in designing inhibitors against SARS-CoV-2 with substantially fewer reward calls.
- [41] arXiv:2407.07059 (replaced) [pdf, html, other]
-
Title: Differentiable Optimization of Similarity Scores Between Models and BrainsNathan Cloos, Moufan Li, Markus Siegel, Scott L. Brincat, Earl K. Miller, Guangyu Robert Yang, Christopher J. CuevaComments: 20 pages, 12 figuresSubjects: Neurons and Cognition (q-bio.NC); Machine Learning (cs.LG)
How do we know if two systems - biological or artificial - process information in a similar way? Similarity measures such as linear regression, Centered Kernel Alignment (CKA), Normalized Bures Similarity (NBS), and angular Procrustes distance, are often used to quantify this similarity. However, it is currently unclear what drives high similarity scores and even what constitutes a "good" score. Here, we introduce a novel tool to investigate these questions by differentiating through similarity measures to directly maximize the score. Surprisingly, we find that high similarity scores do not guarantee encoding task-relevant information in a manner consistent with neural data; and this is particularly acute for CKA and even some variations of cross-validated and regularized linear regression. We find no consistent threshold for a good similarity score - it depends on both the measure and the dataset. In addition, synthetic datasets optimized to maximize similarity scores initially learn the highest variance principal component of the target dataset, but some methods like angular Procrustes capture lower variance dimensions much earlier than methods like CKA. To shed light on this, we mathematically derive the sensitivity of CKA, angular Procrustes, and NBS to the variance of principal component dimensions, and explain the emphasis CKA places on high variance components. Finally, by jointly optimizing multiple similarity measures, we characterize their allowable ranges and reveal that some similarity measures are more constraining than others. While current measures offer a seemingly straightforward way to quantify the similarity between neural systems, our work underscores the need for careful interpretation. We hope the tools we developed will be used by practitioners to better understand current and future similarity measures.
- [42] arXiv:2410.13466 (replaced) [pdf, html, other]
-
Title: Comparing Methodological Variations in Seizure Onset Localisation Algorithms using intracranial EEGSarah J. Gascoigne, Manel Vila-Vidal, Nathan Evans, Christopher Thornton, Heather Woodhouse, Billy Smith, Anderson Brito Da Silva, Rhys H. Thomas, Kevin Wilson, Peter N. Taylor, Adria Tauste Campo, Yujiang WangSubjects: Neurons and Cognition (q-bio.NC)
During clinical treatment for epilepsy, the area of the brain thought to be responsible for pathological activity is identified. This identification is typically performed through visual assessment of EEG recordings; however, this is time consuming and prone to subjective inconsistency. Automated onset localisation algorithms provide objective identification of the onset location by highlighting changes in signal features associated with seizure onset. In this work we investigate how methodological differences in such algorithms can result in different onset locations being identified.
We analysed ictal intracranial EEG (icEEG) recordings in 16 subjects (100 seizures) with drug-resistant epilepsy from the SWEZ-ETHZ public database. We identified a series of key methodological differences that must be considered when designing or selecting an onset localisation algorithm. These differences were demonstrated using three distinct algorithms that capture different, but complementary, seizure onset features: Imprint, Epileptogenicity Index, and Low Entropy Map. We assessed methodological differences (or Decision Points), and their impact on the identified onset locations.
Our independent application of all three algorithms to the same ictal icEEG dataset revealed low agreement between them: 27-60% of onset channels showed minimal or no overlap. Therefore, we investigated the effect of three key differences: (i) how to define a baseline, (ii) whether low-frequency components are considered, and finally (iii) whether electrodecrement is considered. Changes at each Decision Point were found to substantially influence resultant onset channels (r>0.3).
Our results demonstrate how seemingly small methodological changes can result in large differences in onset locations. We propose that key Decision Points must be considered when using or designing an onset localisation algorithm. - [43] arXiv:2410.14453 (replaced) [pdf, other]
-
Title: How EEG preprocessing shapes decoding performanceSubjects: Neurons and Cognition (q-bio.NC)
EEG preprocessing varies widely between studies, but its impact on classification performance remains poorly understood. To address this gap, we analyzed seven experiments with 40 participants drawn from the public ERP CORE dataset. We systematically varied key preprocessing steps, such as filtering, referencing, baseline interval, detrending, and multiple artifact correction steps. Then we performed trial-wise binary classification (i.e., decoding) using neural networks (EEGNet), or time-resolved logistic regressions. Our findings demonstrate that preprocessing choices influenced decoding performance considerably. All artifact correction steps reduced decoding performance across all experiments and models, while higher high-pass filter cutoffs consistently enhanced decoding. For EEGNet, baseline correction further improved performance, and for time-resolved classifiers, linear detrending and lower low-pass filter cutoffs were beneficial. Other optimal preprocessing choices were specific for each experiment. The current results underline the importance of carefully selecting preprocessing steps for EEG-based decoding. If not corrected, artifacts facilitate decoding but compromise conclusive interpretation.
- [44] arXiv:2412.17780 (replaced) [pdf, html, other]
-
Title: PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete DiffusionSubjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI)
Peptide therapeutics, a major class of medicines, have achieved remarkable success across diseases such as diabetes and cancer, with landmark examples such as GLP-1 receptor agonists revolutionizing the treatment of type-2 diabetes and obesity. Despite their success, designing peptides that satisfy multiple conflicting objectives, such as target binding affinity, solubility, and membrane permeability, remains a major challenge. Classical drug development and structure-based design are ineffective for such tasks, as they fail to optimize global functional properties critical for therapeutic efficacy. Existing generative frameworks are largely limited to continuous spaces, unconditioned outputs, or single-objective guidance, making them unsuitable for discrete sequence optimization across multiple properties. To address this, we present PepTune, a multi-objective discrete diffusion model for the simultaneous generation and optimization of therapeutic peptide SMILES. Built on the Masked Discrete Language Model (MDLM) framework, PepTune ensures valid peptide structures with state-dependent masking schedules and penalty-based objectives. To guide the diffusion process, we propose a Monte Carlo Tree Search (MCTS)-based strategy that balances exploration and exploitation to iteratively refine Pareto-optimal sequences. MCTS integrates classifier-based rewards with search-tree expansion, overcoming gradient estimation challenges and data sparsity inherent to discrete spaces. Using PepTune, we generate diverse, chemically-modified peptides optimized for multiple therapeutic properties, including target binding affinity, membrane permeability, solubility, hemolysis, and non-fouling characteristics on various disease-relevant targets. In total, our results demonstrate that MCTS-guided discrete diffusion is a powerful and modular approach for multi-objective sequence design in discrete state spaces.
- [45] arXiv:2307.13918 (replaced) [pdf, html, other]
-
Title: Simulation-based Inference for Cardiovascular ModelsAntoine Wehenkel, Laura Manduchi, Jens Behrmann, Luca Pegolotti, Andrew C. Miller, Guillermo Sapiro, Ozan Sener, Marco Cuturi, Jörn-Henrik JacobsenSubjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
Over the past decades, hemodynamics simulators have steadily evolved and have become tools of choice for studying cardiovascular systems in-silico. While such tools are routinely used to simulate whole-body hemodynamics from physiological parameters, solving the corresponding inverse problem of mapping waveforms back to plausible physiological parameters remains both promising and challenging. Motivated by advances in simulation-based inference (SBI), we cast this inverse problem as statistical inference. In contrast to alternative approaches, SBI provides \textit{posterior distributions} for the parameters of interest, providing a \textit{multi-dimensional} representation of uncertainty for \textit{individual} measurements. We showcase this ability by performing an in-silico uncertainty analysis of five biomarkers of clinical interest comparing several measurement modalities. Beyond the corroboration of known facts, such as the feasibility of estimating heart rate, our study highlights the potential of estimating new biomarkers from standard-of-care measurements. SBI reveals practically relevant findings that cannot be captured by standard sensitivity analyses, such as the existence of sub-populations for which parameter estimation exhibits distinct uncertainty regimes. Finally, we study the gap between in-vivo and in-silico with the MIMIC-III waveform database and critically discuss how cardiovascular simulations can inform real-world data analysis.
- [46] arXiv:2402.15480 (replaced) [pdf, html, other]
-
Title: Foveated Retinotopy Improves Classification and Localization in CNNsSubjects: Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
From a falcon detecting prey to humans recognizing faces, many species exhibit extraordinary abilities in rapid visual localization and classification. These are made possible by a specialized retinal region called the fovea, which provides high acuity at the center of vision while maintaining lower resolution in the periphery. This distinctive spatial organization, preserved along the early visual pathway through retinotopic mapping, is fundamental to biological vision, yet remains largely unexplored in machine learning. Our study investigates how incorporating foveated retinotopy may benefit deep convolutional neural networks (CNNs) in image classification tasks. By implementing a foveated retinotopic transformation in the input layer of standard ResNet models and re-training them, we maintain comparable classification accuracy while enhancing the network's robustness to scale and rotational perturbations. Although this architectural modification introduces increased sensitivity to fixation point shifts, we demonstrate how this apparent limitation becomes advantageous: variations in classification probabilities across different gaze positions serve as effective indicators for object localization. Our findings suggest that foveated retinotopic mapping encodes implicit knowledge about visual object geometry, offering an efficient solution to the visual search problem - a capability crucial for many living species.
- [47] arXiv:2409.00035 (replaced) [pdf, other]
-
Title: EEG Right & Left Voluntary Hand Movement-based Virtual Brain-Computer Interfacing Keyboard Using Hybrid Deep Learning ApproachSubjects: Signal Processing (eess.SP); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Neurons and Cognition (q-bio.NC)
Brain-machine interfaces (BMIs), particularly those based on electroencephalography (EEG), offer promising solutions for assisting individuals with motor disabilities. However, challenges in reliably interpreting EEG signals for specific tasks, such as simulating keystrokes, persist due to the complexity and variability of brain activity. Current EEG-based BMIs face limitations in adaptability, usability, and robustness, especially in applications like virtual keyboards, as traditional machine-learning models struggle to handle high-dimensional EEG data effectively. To address these gaps, we developed an EEG-based BMI system capable of accurately identifying voluntary keystrokes, specifically leveraging right and left voluntary hand movements. Using a publicly available EEG dataset, the signals were pre-processed with band-pass filtering, segmented into 22-electrode arrays, and refined into event-related potential (ERP) windows, resulting in a 19x200 feature array categorized into three classes: resting state (0), 'd' key press (1), and 'l' key press (2). Our approach employs a hybrid neural network architecture with BiGRU-Attention as the proposed model for interpreting EEG signals, achieving superior test accuracy of 90% and a mean accuracy of 91% in 10-fold stratified cross-validation. This performance outperforms traditional ML methods like Support Vector Machines (SVMs) and Naive Bayes, as well as advanced architectures such as Transformers, CNN-Transformer hybrids, and EEGNet. Finally, the BiGRU-Attention model is integrated into a real-time graphical user interface (GUI) to simulate and predict keystrokes from brain activity. Our work demonstrates how deep learning can advance EEG-based BMI systems by addressing the challenges of signal interpretation and classification.
- [48] arXiv:2409.12846 (replaced) [pdf, html, other]
-
Title: How the (Tensor-) Brain uses Embeddings and Embodiment to Encode Senses and SymbolsSubjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Neurons and Cognition (q-bio.NC)
The Tensor Brain (TB) has been introduced as a computational model for perception and memory. This paper provides an overview of the TB model, incorporating recent developments and insights into its functionality. The TB is composed of two primary layers: the representation layer and the index layer. The representation layer serves as a model for the subsymbolic global workspace, a concept derived from consciousness research. Its state represents the cognitive brain state, capturing the dynamic interplay of sensory and cognitive processes. The index layer, in contrast, contains symbolic representations for concepts, time instances, and predicates. In a bottom-up operation, sensory input activates the representation layer, which then triggers associated symbolic labels in the index layer. Conversely, in a top-down operation, symbols in the index layer activate the representation layer, which in turn influences earlier processing layers through embodiment. This top-down mechanism underpins semantic memory, enabling the integration of abstract knowledge into perceptual and cognitive processes. A key feature of the TB is its use of concept embeddings, which function as connection weights linking the index layer to the representation layer. As a concept's ``DNA,'' these embeddings consolidate knowledge from diverse experiences, sensory modalities, and symbolic representations, providing a unified framework for learning and memory.
- [49] arXiv:2410.13881 (replaced) [pdf, other]
-
Title: "Efficient Complexity": a Constrained Optimization Approach to the Evolution of Natural IntelligenceComments: 25 pages, 2 figuresSubjects: Neural and Evolutionary Computing (cs.NE); Neurons and Cognition (q-bio.NC)
A fundamental question in the conjunction of information theory, biophysics, bioinformatics and thermodynamics relates to the principles and processes that guide the development of natural intelligence in natural environments where information about external stimuli may not be available at prior. A novel approach in the description of the information processes of natural learning is proposed in the framework of constrained optimization, where the objective function represented by the information entropy of the internal states of the system with the states of the external environment is maximized under the natural constraints of memory, computing power, energy and other essential resources. The progress of natural intelligence can be interpreted in this framework as a strategy of approximation of the solutions of the optimization problem via a traversal over the extrema network of the objective function under the natural constraints that were examined and described. Non-trivial conclusions on the relationships between the complexity, variability and efficiency of the structure, or architecture of learning models made on the basis of the proposed formalism can explain the effectiveness of neural networks as collaborative groups of small intelligent units in biological and artificial intelligence.
- [50] arXiv:2410.23754 (replaced) [pdf, html, other]
-
Title: RealMind: Advancing Visual Decoding and Language Interaction via EEG SignalsSubjects: Human-Computer Interaction (cs.HC); Neurons and Cognition (q-bio.NC)
Decoding visual stimuli from neural recordings is a critical challenge in the development of brain-computer interfaces (BCIs). Although recent EEG-based decoding approaches have made progress in tasks such as visual classification, retrieval, and reconstruction, they remain constrained by unstable representation learning and a lack of interpretability. This gap highlights the need for more efficient representation learning and the integration of effective language interaction to enhance both understanding and practical usability in visual decoding this http URL address this limitation, we introduce RealMind, a novel EEG-based framework designed to handle a diverse range of downstream tasks. Specifically, RealMind leverages both semantic and geometric consistency learning to enhance feature representation and improve alignment across tasks. Notably, beyond excelling in traditional tasks, our framework marks the first attempt at visual captioning from EEG data through vision-language model (VLM). It achieves a Top-1 decoding accuracy of 27.58% in a 200-class zero-shot retrieval task and a BLEU-1 score of 26.59% in a 200-class zero-shot captioning task. Overall, RealMind provides a comprehensive multitask EEG decoding framework, establishing a foundational approach for EEG-based visual decoding in real-world applications.