Gravitational Wave Mixture Separation for Future Gravitational Wave Observatories Utilizing Deep Learning (2024)

CunLiang MaSchool of Information Engineering, Jiangxi University of Science and Technology, Ganzhou, 341000, China WeiGuang ZhouSchool of Information Engineering, Jiangxi University of Science and Technology, Ganzhou, 341000, China Zhoujian Cao¹¹1corresponding authorzjcao@amt.ac.cnInstitute of Applied Mathematics, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, ChinaSchool of Fundamental Physics and Mathematical Sciences, Hangzhou Institute for Advanced Study, UCAS, Hangzhou 310024, China

Abstract

Future GW observatories, such as the Einstein Telescope (ET), are expected to detect gravitational wave signals, some of which are likely to overlap with each other. This overlap may lead to misidentification as a single GW event, potentially biasing the estimated parameters of mixture GWs. In this paper, we adapt the concept of speech separation to address this issue by applying it to signal separation of overlapping GWs. We show that deep learning models can effectively separate overlapping GW signals. The proposed method may aid in eliminating biases in parameter estimation for such signals.

I Introduction

The field of gravitational-wave (GW) detection has witnessed remarkable progress since the first direct detection [1, 2, 3, 4, 5, 6, 7]. The third observing run (O3) of GW detection ended in spring 2020, boosting the total number of confident events to above 90, with an event rate currently standing at 1.5 per week [8, 9, 10, 11]. However, the upcoming third-generation (3G) detectors such as Einstein Telescope (ET) [12, 13] and Cosmic Explorer (CE) [14, 15], envisioned in the 2030s, promise a significant leap forward. This enables the detection rate of above $10^{5}$ per year at cosmological distances. The surge in detection rate, along with the remarkable enhancement of sensitivity across both lower and higher frequency ranges in 3G detectors, will significantly extend the duration of signals within the sensitivity band. As a consequence, the probability of GW signals overlapping in these 3G detectors will become significant [16], posing potential challenges for the GW search and parameter estimation.

As early as 2009, T. Regimbau and Scott A. Hughes delved into the effects of binary inspiral confusion on the sensitivity of ground-based GW detectors [16]. They emphasized the necessity for rigorous data analysis to disentangle mixture signals. Since then, numerous studies have focused on analyzing the strain with mixture signals. Y. Himemoto et al., utilizing Fisher matrix analysis, explored the statistical ramifications of mixture GWs on parameter estimation [17]. Their findings revealed that mixture signals can introduce notable statistical errors or systematic biases, especially when the coalescence times and redshifted chirp masses of the mixture GWs are closely matched. A realistic distribution analysis further indicated that mergers occurring within a second of each other are common occurrences over a year in 3G detectors [18].

Modern data analysis techniques for parameter estimation typically assume the presence of a single signal amidst background noise. However, when two or more GWs are simultaneously detected, their signals overlap, creating a distorted, non-physical waveform. This leads the sampling software to identify parameter sets aligned with this composite waveform, rather than the individual signals [19]. Experimental results from P. Relton et al. demonstrated that, in most instances, current parameter estimation methods can accurately assess the parameters of one of the mixture events [19]. Notably, if one signal is at least three times stronger than the other, the louder signal’s source parameters remain unaffected [19]. By applying a narrow prior on the coalescence time, obtained during the GW detection phase, it may be feasible to accurately recover both posterior parameter distributions [20]. Experiments conducted by E. Pizzati et al. showed that parameter inference remains robust as long as the coalescence time difference in the detector frame exceeds 1 second [20]. Conversely, when this time difference is less than 0.5 seconds, significant biases in parameter inference are likely to emerge [20]. Upon comparing the effects of mixture signals on coefficients at various post-Newtonian (PN) orders, it has been determined that, overall, the 1PN coefficient experiences the greatest impact. The findings further indicate that, although a significant proportion of mixture signals introduce biases in PN coefficients, which individually might suggest deviations from General Relativity (GR), collectively, these deviations occur in random directions. As a result, a statistical aggregation of these effects would still tend to align with GR [21]. Quantifying source confusion within a realistic neutron star binary population reveals that parameter uncertainty generally rises by less than 1%, except in cases where overlapping signals exist with a detector-frame chirp mass difference of $\lesssim 0.01\,M_{\odot}$ and an overlap frequency of $\gtrsim 40\,\text{Hz}$ [22]. Among $1\times 10^{6}$ simulated signals, only 0.14% fall within this specific range of detector-frame chirp mass differences, yet their overlap frequencies are usually below 40 Hz [22].

Apart from the task of parameter estimation, several studies focus on exploring the impact of overlapping signals on gravitational wave detection. Within the CWB framework for GW searching, most signals resulting from closely merged events will only be detected as a single trigger [23]. In the context of the PyCBC framework and the search for binary black hole (BBH) events, it has been noted that when the relative merger time exceeds 1 second, the search efficiency diminishes by approximately 1% [23]. In cases where the relative merger time is less than 1 second, the search efficiency drops by 26% because most paired signals are either detected by a single trigger or not detected at all [23]. The biases in the estimation of the PSD will negatively impact the sensitivity of the 3G ground-based GW detectors, especially considering the large population of overlapping signals [24]. The confusion noise’s contribution to the signal-to-noise ratio (SNR) is considerably lesser than that of the instrumental noise [24].

Certain studies focus on refining data processing techniques to address the challenge posed by overlapping signals. J. Janquart et al. analyze the overlapping binary black hole merger with hierarchical subtraction and joint parameter estimation [25]. They find that joint parameter estimation is usually more precise but comes with higher computational costs. J. Langendorff et al. first utilize normalizing flows for the parameter estimation of overlapping GW signals [26]. Compared to the traditional Bayesian method, the normalizing flow results in broader posterior distributions, whereas the Bayesian-based approach tends to become overconfident, potentially overlooking the injection [26].

Recently, we have proposed a novel framework (MSNRnet) aimed at accelerating the matched filtering process for GW detection [27]. This is achieved by incorporating deep learning techniques for waveform extraction and discrimination. However, as the waveform extraction stage solely captures one waveform, in scenarios where multiple signals overlap, there is a possibility that the MSNRnet framework may overlook one of the overlapping signals.

Real-world speech communication frequently takes place in vibrant, multi-speaker settings [28]. To function effectively in these environments, a speech processing system must possess the capability to distinguish and separate speeches from various speakers. While this endeavor comes naturally to humans, it has been exceedingly challenging to replicate in machines. However, in recent years, deep learning strategies have notably pushed the boundaries of this problem [29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40], surpassing traditional techniques like independent component analysis (ICA) [41] and semi-nonnegative matrix factorization (semi-NMF) [42]. The mixed speech can be compared to mixed GW signals. Drawing inspiration from the task of speech separation, this study marks the first attempt to apply deep learning to GW separation. The proposed method for GW signal separation holds potential for future applications in GW search and parameter estimation. Furthermore, this work serves as a complement to the existing tasks of deep learning applied to GW data processing, including end-to-end GW signal search [43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61], parameter estimation [62, 63, 64, 65, 66], waveform or envelope extraction [67, 68, 69, 70], GW source localization [71, 72, 73, 74], and glitch classification [75, 76, 77, 78]. Since the GW components buried in noise, the GW separation task is more challenging than speech separation.

In this work, we first explored the potential of utilizing deep learning for GW separation. We find that the mixture strain with noise and multi-signals can be separated.

II METHOD FOR GW SEPARATION

In the early stages of applying deep learning to speech separation, the preprocessing phase typically involved converting mixed sound into a time-frequency representation [79, 80, 81, 82], isolating source bins via time-frequency masks, and synthesizing waveforms via invert time-frequency transform. However, challenges arose, including the optimality of Fourier decomposition and the need to handle both magnitude and phase in the complex STFT domain. This often led to methods that only adjusted the magnitude, ultimately limiting separation performance. In 2018, Luo et al. introduced the Time-domain Audio Separation Network (TasNet) [28]. This neural network was designed to directly model the time-domain mixture waveform through an encoder-separation-decoder framework, where the actual separation occurred at the encoder’s output. The following year, they further refined TasNet, evolving it into Conv-TasNet [29]. The key innovation of Conv-TasNet was the use of a Temporal Convolutional Network (TCN) for the separation component, consisting of stacked one-dimensional dilated convolutional blocks. In 2020, the same team proposed DPRNN [32], which incorporated a dual-path RNN for the separation phase. Later that year, J. Chen et al. enhanced DPRNN, giving birth to DPTNet [34]. This advancement replaced the dual-path RNN module with a dual-path transformer module. We have utilized all three iterations of TasNet—Conv-TasNet, DPRNN, and DPTNet—for the task of GW separation. Among these, we find that DPRNN has proven to be superior to the other two methods. So, in this work, we focus on DPRNN for GW separation.

Suppose that the strain captured by the interferometer, denoted as $d(t)$ , can be regarded as a combination of a noise component, $n(t)$ , and the GW component, $h(t)$ .

d\left(t\right)=n\left(t\right)+h\left(t\right)

(1)

II.1 Encoder stage

Suppose the Encoder receives an input signal $s\in\mathbb{R}^{1\times L}$ , where $L$ denotes the number of time samples of the input strain. Through the Encoder stage, we get the signal feature $F\in\mathbb{R}^{C\times L}$ by

F=\text{ReLU}\left(\text{Conv1D}\left(s\right)\right),

(5)

where in the 1D convolutional layer, C=256 filters are used, and the filter size is configured to 2.

II.2 Separation stage

The input of the separation stage is signal feature $F$ and the output generates two feature masks namely $M_{A}$ and $M_{B}$ . The signal feature $F$ is initially passed through Layer Normalization and a Conv1D layer, undergoing transformation into a tensor representation having a shape of $\mathbb{R}^{N\times L}$ where $N=64$ represents the number of Conv1D filters. Afterward, the tensor sequentially undergoes a segmentation operation, followed by processing through four DPRNN blocks, and concludes with an overlap-add operation. In the segmentation step, the 2D tensor undergoes a transformation into a 3D tensor through sub-frame alternation. This transformed tensor is then relayed to a stack of DPRNN blocks, where both local and global modeling are alternately and interactively employed. Upon completion of DPRNN processing, the output from the final layer is conveyed to a 2D convolutional layer and subsequently reverted to two 2D tensors via the Overlap-Add operation. These tensors are then simultaneously processed through two distinct convolutional modules equipped with different activation functions: Tanh and Sigmoid. Following this, the tensors are combined and subjected to a ReLU activation function, ultimately yielding two masks, designated as $M_{A}$ and $M_{B}$ .

II.2.1 Segmentation and Overlap-Add

Fig.2 shows the flow chart of the Segmentation and Overlap-Add step in the separation stage. Let the input of the segmentation is a 2D tensor $F$ and the output of the segmentation is a 3D tensor $T$ . For the segmentation stage, we first split the 2D tensor to $S$ small tensors ( $D_{i}\in\mathbb{R}^{N\times K}$ , $i\in\{1,2,\ldots,S\}$ ). Then concatenate all the small 2D tensors together to form a 3D tensor $T=[D_{1},D_{2},\ldots,D_{S}]\in\mathbb{R}^{N\times K\times S}$ . In this work $K=250$ and $S=134$ .

Gravitational Wave Mixture Separation for Future Gravitational Wave Observatories Utilizing Deep Learning (2)

Suppose the output of the last DPRNN block as $T_{B+1}\in\mathbb{R}^{N\times K\times S}$ , then the Overlap-Add step can be seen as the inverse process of the Segmentation step. It applies the $S$ 2D tensors to form output $Q\in\mathbb{R}^{N\times L}$ . Initially, we split the 3D tensor into $S$ 2D tensors and aligned according to real-time. Following this, we added the $S$ 2D tensors up and got one 2D tensor.

II.2.2 DPRNN block

The segmentation output $T$ is subsequently forwarded to a stack consisting of 4 DPRNN blocks. Each block maps a 3D tensor into another 3D tensor of the same shape. Let’s take the map $T_{i}\rightarrow T_{i+1}$ as an example to illustrate the calculation process of a DPRNN block. The flow chart depicting the DPRNN block is illustrated in Fig.3. Initially, the input tensor is processed through a local modeling block, followed by a global modeling block. The key distinction between these two blocks lies in their approach to signal slicing. Specifically, the local modeling block slices the 3D tensor based on the third indicator, whereas the global modeling block performs slicing using the second indicator. For brevity, we only detail the mathematical expression pertaining to local modeling in this context.

Gravitational Wave Mixture Separation for Future Gravitational Wave Observatories Utilizing Deep Learning (3)

Suppose the input of the local modeling is $T_{i}$ and the output is $\hat{T}_{i}$ . We first put each divided chunk to a bidirectional LSTM block and concatenate them together to get a tensor $U_{i}\in\mathbb{R}^{H\times K\times S}$ . In this work, we set $H$ to 256.

U_{i}=\underset{j}{\operatorname*{Concatenate}}\text{ BiLSTM}(T_{i}[:,:,j]),

(6)

II.3 Decoder stage

The Decoder stage maps the masked Encoded feature $F_{M_{i}}=M_{i}\odot F\in\mathbb{R}^{C\times L}$ to separated signal. Each element in $F_{M_{i}}$ (which can be likened to feature values) may be viewed as a component of a hidden vector (comparable to feature vectors) at a specific time.

\tilde{h}_{i}=ConvTranspose1d(F_{M_{i}}),

(10)

The hidden vectors can be regarded as the adjustable parameters of the transposed convolutional layer. This layer accepts $N$ input channels and outputs a single channel. Its purpose is to decrease the channel count of the masked encoded features from $C$ to $1$ . By configuring the kernel size as $2$ , stride as $1$ , and padding as $0$ , the transposed convolution preserves the length of the time series at $L$ . As a result, the masked encoded features are reconstituted into a one-dimensional time series, denoted as $\widetilde{h}_{i}\in\mathbb{R}^{1\times L}$ .

III DATA FOR TRAINING AND TESTING

In this paper, we concentrate on the Einstein Telescope, which could potentially consist of three detectors arranged in a triangular configuration. For simplicity, we limit our analysis to just one of these detectors. We utilize the PyCBC package [83, 84, 85, 86] for synthesizing data, which aids in training, validation, and testing processes. The strain captured by the detector can be represented as a combination of noise and two mixture signals: $n(t)+h_{A}(t)+h_{B}(t)$ , where $n(t)$ signifies the noise component. This noise is generated using the power spectrum density (PSD) linked to the Einstein Telescope, which offers insights into the detector’s sensitivity at various frequencies. Specifically, we use EinsteinTelescopeP1600143 to simulate this noise.

Both $h_{A}(t)$ and $h_{B}(t)$ are generated through a linear combination of $h_{+}(t)$ and $h_{\times}(t)$ , which are accurately modeled by SEOBNRv4. In our waveform simulation, the masses of the two black holes range from $(10M_{\odot},80M_{\odot})$ . The dimensionless spin is randomly sampled within the interval $(0,0.998)$ . Additionally, the declination and right ascension are uniformly sampled across the entire sphere. During the simulation of $h_{+}(t)$ and $h_{\times}(t)$ , the luminosity distance from the astrophysical source to Earth is fixed at 4000 Mpc.

In the training phase, the amplitudes of $h_{A}(t)$ and $h_{B}(t)$ undergo random rescaling to align with two randomly generated signal-to-noise ratios (SNRs) falling between 5 and 20. Furthermore, the peak amplitude times of $h_{A}(t)$ and $h_{B}(t)$ are randomly positioned between 50% and 95% of the designated time window, which spans a duration of 4 seconds. The entire simulation operates at a sampling frequency of 4096 Hz.

IV PERFORMANCE OF THE GW SEPARATION NETWORK

Previous studies examining data processing of overlapping gravitational wave (GW) strains have primarily focused on how GW overlapping affects traditional GW data processing methods, such as matched filtering for GW detection [23] and Bayesian posterior sampling for parameter estimation [20]. Recently, the normalizing flow has emerged as a new technique for parameter estimation of overlapping GW strains [26, 84]. In our study, we propose the utilization of signal separation via deep learning for the analysis of overlapping GW strains.

The gravitational wave (GW) separation network can be considered a parameterized system. The network’s output includes the waveforms of the estimated clean gravitational wave signals. To optimize the performance of the proposed model, we train it using utterance-level permutation invariant training (uPIT) [87], aiming to maximize the scale-invariant signal-to-noise ratio (SI-SNR) [28]. SI-SNR is defined as:

$\displaystyle s_{target}$	$\displaystyle=\frac{\langle\tilde{h},h\rangle h}{\\|h\\|^{2}}$	(11)
$\displaystyle e_{noise}$	$\displaystyle=\tilde{h}-s_{target}$	(12)
SI-SNR	$\displaystyle:=10\log_{10}\frac{\\|s_{target}\\|^{2}}{\\|e_{noise}\\|^{2}}$	(13)

where $\tilde{h}\in\mathbb{R}^{1\times L}$ and ${h}\in\mathbb{R}^{1\times L}$ are the estimated and target clean sources respectively, $L$ denotes the length of the signals, and $\tilde{h}$ and $h$ are both normalized to have zero-mean to ensure scale-invariance. During the training phase, the Adam method is used. A learning rate of $10^{-5}$ is established. The system undergoes 20 epochs of training. During the training stage, we assume that the peak time of signal A lags behind that of signal B. In other words, typically, signal A is only disrupted by the inspiral stage of signal B, whereas signal B experiences interference from the entire signal process, encompassing the inspiral, merger, and ringdown stages.

In this section, we explore the performance of the GW separation network. Prior researches [19, 20] have established that the accuracy of parameter estimation for the two sources can be notably influenced by both the peak time difference and the SNR difference. Our study examines how these two factors specifically affect GW separation.

Fig.4 illustrates an example of overlapping signal shapes, considering variations in peak time differences (a) and signal-to-noise ratio (SNR) differences (b). In subsequent sub-sections, we will introduce noise to these waveforms to produce simulated strain data, and then evaluate the performance of the GW separation model using this simulated data. From this figure, it is evident that, in most scenarios, the near merger and ringdown stages of signal A remain unaffected, whereas all stages of signal B appear blurred.

The following subsections will demonstrate that despite the blurring of signal B and the inspiral stage of signal A, in most cases, the waveforms of both signal A and signal B can often be accurately reconstructed.

Gravitational Wave Mixture Separation for Future Gravitational Wave Observatories Utilizing Deep Learning (4)

Gravitational Wave Mixture Separation for Future Gravitational Wave Observatories Utilizing Deep Learning (5)

Gravitational Wave Mixture Separation for Future Gravitational Wave Observatories Utilizing Deep Learning (6)

Gravitational Wave Mixture Separation for Future Gravitational Wave Observatories Utilizing Deep Learning (7)

Gravitational Wave Mixture Separation for Future Gravitational Wave Observatories Utilizing Deep Learning (8)

IV.1 Impact of peak time difference on the GW separation

In this subsection, we elaborate on the influence of peak time disparities on GW separation. We produce three elements constituting a single strain: noise, signal A, and signal B. The source parameters of signal A and signal B are the same as the waveform shown in Fig.4. With signal A peaking at 3.7 seconds within the entire strain window, we adjust the peak time of signal B to generate eight distinct waveforms. These waveforms exhibit time differences between the peaks of signal A and signal B ranging from -0.7 s to 0 s. By combining these three components, we synthesize eight unique strains. Subsequently, we subject these strains to the GW separation network and analyze the outputs. Fig.5 displays the individual outputs corresponding to each of the eight strains. To measure the separation performance, we utilize the overlap between the two separated signals and the two original signals. The overlap of signal $h$ and $\widetilde{h}$ can be written as

overlap\big{(}h,\tilde{h}\big{)}=\frac{\int h(t)\tilde{h}(t)dt}{\sqrt{\int h^{%2}(t)dt\int\tilde{h}^{2}(t)dt}}

(14)

From Fig.5 we can see that all eight strains have been successfully separated. Surprisingly, in extreme situations where the peak time of signal A and signal B are the same, the overlaps of both signal A and signal B are greater than 0.95.

Fig.5 presents a single case study demonstrating the effect of peak time difference on GW separation. Here, we undertake a comprehensive statistical analysis to investigate the broader influence of peak time disparities on the process of GW separation. To this end, we have generated eleven sub-test-datasets, with the sole difference among them being the peak time disparities, specifically {-1.0 s, -0.9 s, -0.8 s, -0.7 s, -0.6 s, -0.5 s, -0.4 s, -0.3 s, -0.2 s, -0.1 s, 0 s}. Each of these sub-datasets comprises 1000 samples, ensuring consistency in noise distribution and other parameter distribution across all datasets. The stack plot in Fig.6 illustrates the distribution of separated signals based on their relative merger time ( $T_{B}-T_{A}$ ). Please note that if the overlap between the isolated signal and the actual injected signal exceeds 0.9, we consider the signal to be successfully isolated. This figure reveals that in most scenarios, both signal A and signal B are effectively separated. Notably, even in the most extreme circ*mstance, where the merger time of signal A and signal B coincide, over 80% of the samples are still accurately separated, while approximately 10% of the samples yield successful separation of only one of the two injections. Approximately 5% of the samples show unsuccessful separation for both signal A and signal B. These results further underscore the exceptional performance of our model in denoising and separating mixed signals. The model effectively distinguishes overlapped signals under different peak time difference conditions, achieving high-quality separation results in the majority of cases. This highlights its robustness and capability in signal-processing tasks.

IV.2 Impact of SNR difference on the GW separation

In the preceding section, we discussed the impact of peak time differences on the separation of gravitational wave signals. In practical scenarios, the amplitudes of the individual components within the entangled signals exhibit diversity. Herein, we delve into the influence of signal strength on GW disentanglement. Signal strength can be quantified by the matched signal-to-noise ratio (SNR). To be specific, we maintain an SNR of 10 for signal A while adjusting the SNR differential between signal B and signal A in increments of 2, spanning from -4 to 10. Consequently, the SNRs for signal B are adjusted to the following values: {6, 8, 10, 12, 14, 16, 18, 20}.

We configure the parameters identically to those presented in Fig.4. Specifically, we establish the peak time of signal A at 3.7 seconds within the strain window and set the peak time of signal B at 3.5 seconds, resulting in a peak time difference of -0.2 seconds. We then adjusted the SNR of signal B, varying it from 6 to 20. After superimposing signal A, signal B, and noise, we input the combined signal into the Gravitational Wave (GW) separation network and obtained the output. Fig.7 illustrates the separated and injected waveforms for both signal A and signal B.

Here, we analyze the influence of signal A on the GW separation performance of signal B by the right column of Fig.7. By changing the SNR of signal B from 6 to 20, the separation results of signal A almost unchanged. All the separation overlaps of signal A are greater than 0.98.

When the Signal-to-Noise Ratio (SNR) of signal B is 6, we can see that the overlap between the separated signal B and the buried signal is approximately 0.82. We hypothesize that there may be two primary factors influencing the separation performance of signal B. Firstly, the SNR of signal B is significantly low, causing noise to interfere with the separation process. Secondly, both signal A and noise contribute to the decrease in separation performance. To gain a deeper understanding of the reasons behind the incorrect separation, we subtract signal A and preserve only signal B and the noise in the strain data. This modified data is then inputted into the separation model to observe the impact on the separation of signal B. We verified that the overlap of signal B is equal to 0.80, which is nearly identical to 0.82. The results suggest that the underwhelming performance observed in the separation of signal B in Fig.7 (a) is unrelated to the overlapping signal B, but is instead impacted by the intensity of noise.

To further investigate the impact of SNR differences on separation performance and identify potential shortcomings of our model, we prepared 1,000 samples for each SNR difference value. Fig.8 illustrates the four separation scenarios under different SNR differences, with the x-axis representing SNR differences ranging from -4 to 10. Note that the SNR of signal A is set to 10. We set the SNR of signal B to {6, 8, 10, 12, 14, 16, 18, 20} corresponding to the SNR difference {-4, -2, 0, 2, 4, 6, 8, 10}. From the area chart in Figure 8, it is evident that the orange region, indicating the successful separation of both signals, occupies the majority of the area. The red region, representing scenarios where neither signal was successfully separated, remains very small. Specifically, when the SNR of Signal A is fixed at 10 and the SNR of Signal B is 6 or 8, the instances where only Signal A is successfully separated significantly outnumber the instances where only Signal B is successfully separated. This indicates that, in scenarios with smaller SNR differences, the model is more likely to successfully separate the signal with the higher SNR. These results suggest that further optimization is needed to enhance the model’s performance in separating overlapping signals with low SNR parts. At the same time, they also confirm the robustness of the current model in most cases.

V Conclusion

In this paper, we attempt to address the challenge posed by overlapping GW signals, which is an emerging issue as future GW observatories. We have demonstrated the feasibility of adapting speech separation techniques to the domain of GW signal separation, employing deep learning models for this task. Our findings reveal that the proposed approach can effectively disentangle overlapping GW signals, even when they exhibit different peak time differences. This capability ensures robust signal identification and accurate extraction of individual GW events from a complex signal mixture. Additionally, we observed that the method performs remarkably well across a range of SNRs. Even in low SNR scenarios, where noise levels are relatively high, the model demonstrates its ability to separate and identify GW signals with reasonable accuracy.

References

[1]Junaid Aasi, BPAbbott, Richard Abbott, Thomas Abbott, MRAbernathy, Kendall Ackley, Carl Adams, Thomas Adams, Paolo Addesso, RXAdhikari, etal.Advanced ligo.Classical and quantum gravity, 32(7):074001, 2015.
[2]Fetal Acernese, MAgathos, KAgatsuma, Damiano Aisa, NAllemandou, Aea Allocca, JAmarni, Pia Astone, GBalestri, GBallardin, etal.Advanced virgo: a second-generation interferometric gravitational wave detector.Classical and Quantum Gravity, 32(2):024001, 2014.
[3]BenjaminP Abbott, RAbbott, TDAbbott, MRAbernathy, Fausto Acernese, KAckley, CAdams, TAdams, Paolo Addesso, RXAdhikari, etal.Gw150914: The advanced ligo detectors in the era of first discoveries.Physical review letters, 116(13):131103, 2016.
[4]BenjaminP Abbott, Richard Abbott, TDAbbott, MRAbernathy, Fausto Acernese, KAckley, CAdams, TAdams, Paolo Addesso, RXAdhikari, etal.Properties of the binary black hole merger gw150914.Physical review letters, 116(24):241102, 2016.
[5]LIGO Scientific, Virgo Collaborations, BPAbbott, RAbbott, TDAbbott, MRAbernathy, FAcernese, KAckley, CAdams, TAdams, etal.Tests of general relativity with gw150914.Physical review letters, 116(22):221101, 2016.
[6]BenjaminP Abbott, RAbbott, TDAbbott, MRAbernathy, Fausto Acernese, KAckley, CAdams, TAdams, Paolo Addesso, RXAdhikari, etal.Gw150914: First results from the search for binary black hole coalescence with advanced ligo.Physical Review D, 93(12):122003, 2016.
[7]BenjaminP Abbott, Richard Abbott, ThomasD Abbott, MRAbernathy, Fausto Acernese, KAckley, CAdams, TAdams, Paolo Addesso, RanaX Adhikari, etal.Improved analysis of gw150914 using a fully spin-precessing waveform model.Physical Review X, 6(4):041014, 2016.
[8]BenjaminP Abbott, Richard Abbott, TDea Abbott, SAbraham, FAcernese, KAckley, CAdams, RXAdhikari, VBAdya, Christoph Affeldt, etal.Gwtc-1: a gravitational-wave transient catalog of compact binary mergers observed by ligo and virgo during the first and second observing runs.Physical Review X, 9(3):031040, 2019.
[9]Richard Abbott, TDAbbott, SAbraham, FAcernese, KAckley, AAdams, CAdams, RXAdhikari, VBAdya, Christoph Affeldt, etal.Gwtc-2: compact binary coalescences observed by ligo and virgo during the first half of the third observing run.Physical Review X, 11(2):021053, 2021.
[10]RAbbott, TDAbbott, FAcernese, KAckley, CAdams, NAdhikari, RXAdhikari, VBAdya, CAffeldt, DAgarwal, etal.Gwtc-2.1: Deep extended catalog of compact binary coalescences observed by ligo and virgo during the first half of the third observing run.Physical Review D, 109(2):022001, 2024.
[11]Richard Abbott, TDAbbott, FAcernese, KAckley, CAdams, NAdhikari, RXAdhikari, VBAdya, CAffeldt, DAgarwal, etal.Gwtc-3: compact binary coalescences observed by ligo and virgo during the second part of the third observing run.Physical Review X, 13(4):041039, 2023.
[12]MPunturo, MAbernathy, Fausto Acernese, BAllen, Nils Andersson, KArun, Fabrizio Barone, BBarr, MBarsuglia, MBeker, etal.The einstein telescope: A third-generation gravitational wave observatory.Classical and Quantum Gravity, 27(19):194002, 2010.
[13]SHild, MAbernathy, Fea Acernese, PAmaro-Seoane, NAndersson, KArun, FBarone, BBarr, MBarsuglia, MBeker, etal.Sensitivity studies for third-generation gravitational wave observatories.Classical and Quantum gravity, 28(9):094013, 2011.
[14]David Reitze, RanaX Adhikari, Stefan Ballmer, Barry Barish, Lisa Barsotti, GariLynn Billingsley, DuncanA Brown, Yanbei Chen, Dennis Coyne, Robert Eisenstein, etal.Cosmic explorer: the us contribution to gravitational-wave astronomy beyond ligo.arXiv preprint arXiv:1907.04833, 2019.
[15]BenjaminP Abbott, Richard Abbott, ThomasD Abbott, MatthewR Abernathy, Kendall Ackley, Carl Adams, Paolo Addesso, RanaX Adhikari, VaishaliB Adya, Christoph Affeldt, etal.Exploring the sensitivity of next generation gravitational wave detectors.Classical and Quantum Gravity, 34(4):044001, 2017.
[16]Tania Regimbau and ScottA Hughes.Gravitational-wave confusion background from cosmological compact binaries: Implications for future terrestrial detectors.Physical Review D, 79(6):062002, 2009.
[17]Yoshiaki Himemoto, Atsushi Nishizawa, and Atsushi Taruya.Impacts of overlapping gravitational-wave signals on the parameter estimation: Toward the search for cosmological backgrounds.Physical Review D, 104(4):044010, 2021.
[18]Anuradha Samajdar, Justin Janquart, Chris Van DenBroeck, and Tim Dietrich.Biases in parameter estimation from overlapping gravitational-wave signals in the third-generation detector era.Physical Review D, 104(4):044003, 2021.
[19]Philip Relton and Vivien Raymond.Parameter estimation bias from overlapping binary black hole events in second generation interferometers.Physical Review D, 104(8):084039, 2021.
[20]Elia Pizzati, Surabhi Sachdev, Anuradha Gupta, and BSSathyaprakash.Toward inference of overlapping gravitational-wave signals.Physical Review D, 105(10):104016, 2022.
[21]Yixuan Dang, Ziming Wang, Dicong Liang, and Lijing Shao.Impact of overlapping signals on parameterized post-newtonian coefficients in tests of gravity.The Astrophysical Journal, 964(2):194, 2024.
[22]AaronD Johnson, Katerina Chatziioannou, and WillM Farr.Source confusion from neutron star binaries in ground-based gravitational wave detectors is minimal.Physical Review D, 109(8):084015, 2024.
[23]Philip Relton, Andrea Virtuoso, Sophie Bini, Vivien Raymond, Ian Harry, Marco Drago, Claudia Lazzaro, Andrea Miani, and Shubhanshu Tiwari.Addressing the challenges of detecting time-overlapping compact binary coalescences.Physical Review D, 106(10):104045, 2022.
[24]Shichao Wu and AlexanderH Nitz.Mock data study for next-generation ground-based detectors: The performance loss of matched filtering due to correlated confusion noise.Physical Review D, 107(6):063022, 2023.
[25]Justin Janquart, Tomasz Baka, Anuradha Samajdar, Tim Dietrich, and Chris Van DenBroeck.Analyses of overlapping gravitational wave signals using hierarchical subtraction and joint parameter estimation.Monthly Notices of the Royal Astronomical Society, 523(2):1699–1710, 2023.
[26]Jurriaan Langendorff, Alex Kolmus, Justin Janquart, and Chris Van DenBroeck.Normalizing flows as an avenue to studying overlapping gravitational wave signals.Physical Review Letters, 130(17):171402, 2023.
[27]CunLiang Ma, Sen Wang, Wei Wang, and Zhoujian Cao.Using deep learning to predict matched signal-to-noise ratio of gravitational waves.Physical Review D, 109(4):043009, 2024.
[28]YiLuo and Nima Mesgarani.Tasnet: time-domain audio separation network for real-time, single-channel speech separation.In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 696–700. IEEE, 2018.
[29]YiLuo and Nima Mesgarani.Conv-tasnet: Surpassing ideal time–frequency magnitude masking for speech separation.IEEE/ACM transactions on audio, speech, and language processing, 27(8):1256–1266, 2019.
[30]Yuzhou Liu and DeLiang Wang.Divide and conquer: A deep casa approach to talker-independent monaural speaker separation.IEEE/ACM Transactions on audio, speech, and language processing, 27(12):2092–2102, 2019.
[31]YiLuo, Cong Han, Nima Mesgarani, Enea Ceolini, and Shih-Chii Liu.Fasnet: Low-latency adaptive beamforming for multi-microphone audio processing.In 2019 IEEE automatic speech recognition and understanding workshop (ASRU), pages 260–267. IEEE, 2019.
[32]YiLuo, Zhuo Chen, and Takuya Yoshioka.Dual-path rnn: efficient long sequence modeling for time-domain single-channel speech separation.In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 46–50. IEEE, 2020.
[33]Cunhang Fan, Jianhua Tao, Bin Liu, Jiangyan Yi, Zhengqi Wen, and Xuefei Liu.Deep attention fusion feature for speech separation with end-to-end post-filter method.arXiv preprint arXiv:2003.07544, 2020.
[34]Jingjing Chen, Qirong Mao, and Dong Liu.Dual-path transformer network: Direct context-aware modeling for end-to-end monaural speech separation.arXiv preprint arXiv:2007.13975, 2020.
[35]KeTan, Buye Xu, Anurag Kumar, Eliya Nachmani, and Yossi Adi.Sagrnn: Self-attentive gated rnn for binaural speaker separation with interaural cue preservation.IEEE Signal Processing Letters, 28:26–30, 2020.
[36]Cem Subakan, Mirco Ravanelli, Samuele Cornell, Mirko Bronzi, and Jianyuan Zhong.Attention is all you need in speech separation.In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 21–25. IEEE, 2021.
[37]Neil Zeghidour and David Grangier.Wavesplit: End-to-end speech separation by speaker clustering.IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29:2840–2849, 2021.
[38]Kai Li, Runxuan Yang, and Xiaolin Hu.An efficient encoder-decoder architecture with top-down attention for speech separation.arXiv preprint arXiv:2209.15200, 2022.
[39]Shengkui Zhao and Bin Ma.Mossformer: Pushing the performance limit of monaural speech separation using gated single-head transformer with convolution-augmented joint self-attentions.In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 1–5. IEEE, 2023.
[40]Samuel Pegg, Kai Li, and Xiaolin Hu.Rtfs-net: Recurrent time-frequency modelling for efficient audio-visual speech separation.arXiv preprint arXiv:2309.17189, 2023.
[41]Fa-Yu Wang, Chong-Yung Chi, Tsung-Han Chan, and Yue Wang.Nonnegative least-correlated component analysis for separation of dependent sources by volume maximization.IEEE transactions on pattern analysis and machine intelligence, 32(5):875–888, 2009.
[42]ChrisHQ Ding, Tao Li, and MichaelI Jordan.Convex and semi-nonnegative matrix factorizations.IEEE transactions on pattern analysis and machine intelligence, 32(1):45–55, 2008.
[43]Daniel George and EliuAntonio Huerta.Deep learning for real-time gravitational wave detection and parameter estimation: Results with advanced ligo data.Physics Letters B, 778:64–70, 2018.
[44]Hunter Gabbard, Michael Williams, Fergus Hayes, and Chris Messenger.Matching matched filtering with deep networks for gravitational-wave astronomy.Physical review letters, 120(14):141103, 2018.
[45]TimothyD Gebhard, Niki Kilbertus, Ian Harry, and Bernhard Schölkopf.Convolutional neural networks: A magic bullet for gravitational-wave detection?Physical Review D, 100(6):063015, 2019.
[46]AndrewL Miller, Pia Astone, Sabrina D’Antonio, Sergio Frasca, Giuseppe Intini, Iuri LaRosa, Paola Leaci, Simone Mastrogiovanni, Federico Muciaccia, Andonis Mitidis, etal.How effective is machine learning to detect long transient gravitational waves from neutron stars in a real search?Physical Review D, 100(6):062005, 2019.
[47]EAHuerta, Asad Khan, Xiaobo Huang, Minyang Tian, Maksim Levental, Ryan Chard, Wei Wei, Maeve Heflin, DanielS Katz, Volodymyr Kindratenko, etal.Accelerated, scalable and reproducible ai-driven gravitational wave detection.Nature Astronomy, 5(10):1062–1068, 2021.
[48]Jingkai Yan, Mariam Avagyan, RobertE Colgan, Doğa Veske, Imre Bartos, John Wright, Zsuzsa Márka, and Szabolcs Márka.Generalized approach to matched filtering using neural networks.Physical Review D, 105(4):043006, 2022.
[49]CunLiang Ma, Wei Wang, HeWang, and Zhoujian Cao.Ensemble of deep convolutional neural networks for real-time gravitational wave signal recognition.Physical Review D, 105(8):083013, 2022.
[50]Paraskevi Nousi, AlexandraE Koloniari, Nikolaos Passalis, Panagiotis Iosif, Nikolaos Stergioulas, and Anastasios Tefas.Deep residual networks for gravitational wave detection.Physical Review D, 108(2):024022, 2023.
[51]MarlinB Schäfer, Ondřej Zelenka, AlexanderH Nitz, HeWang, Shichao Wu, Zong-Kuan Guo, Zhoujian Cao, Zhixiang Ren, Paraskevi Nousi, Nikolaos Stergioulas, etal.First machine learning gravitational-wave search mock data challenge.Physical Review D, 107(2):023021, 2023.
[52]HeWang, Shichao Wu, Zhoujian Cao, Xiaolin Liu, and Jian-Yang Zhu.Gravitational-wave signal recognition of ligo data by deep learning.Physical Review D, 101(10):104003, 2020.
[53]PlamenG Krastev.Real-time detection of gravitational waves from binary neutron stars using artificial neural networks.Physics Letters B, 803:135330, 2020.
[54]Heming Xia, Lijing Shao, Junjie Zhao, and Zhoujian Cao.Improved deep learning techniques in gravitational-wave data analysis.Physical Review D, 103(2):024040, 2021.
[55]Srashti Goyal, ShasvathJ Kapadia, and Parameswaran Ajith.Rapid identification of strongly lensed gravitational-wave events with machine learning.Physical Review D, 104(12):124057, 2021.
[56]Alexis Menéndez-Vázquez, Machiel Kolstein, Mario Martinez, and LlM Mir.Searches for compact binary coalescence events using neural networks in the ligo/virgo second observation period.Physical Review D, 103(6):062004, 2021.
[57]MLópez, IDiPalma, MDrago, PCerdá-Durán, and FRicci.Deep learning for core-collapse supernova detection.Physical Review D, 103(6):063011, 2021.
[58]MarlinB Schäfer, Ondřej Zelenka, AlexanderH Nitz, Frank Ohme, and Bernd Brügmann.Training strategies for deep learning gravitational-wave searches.Physical Review D, 105(4):043002, 2022.
[59]Alec Gunny, Dylan Rankin, Jeffrey Krupa, Muhammed Saleem, Tri Nguyen, Michael Coughlin, Philip Harris, Erik Katsavounidis, Steven Timm, and Burt Holzman.Hardware-accelerated inference for real-time gravitational-wave astronomy.Nature Astronomy, 6(5):529–536, 2022.
[60]João Aveiro, FelipeF Freitas, Márcio Ferreira, Antonio Onofre, Constança Providência, Gonçalo Gonçalves, and JoséA Font.Identification of binary neutron star mergers in gravitational-wave data using object-detection machine learning models.Physical Review D, 106(8):084059, 2022.
[61]MarlinB Schäfer and AlexanderH Nitz.From one to many: A deep learning coincident gravitational-wave search.Physical Review D, 105(4):043003, 2022.
[62]StephenR Green, Christine Simpson, and Jonathan Gair.Gravitational-wave parameter estimation with autoregressive neural network flows.Physical Review D, 102(10):104057, 2020.
[63]Han-Shiang Kuo and Feng-Li Lin.Conditional noise deep learning for parameter estimation of gravitational wave events.Physical Review D, 105(4):044016, 2022.
[64]Maximilian Dax, StephenR Green, Jonathan Gair, JakobH Macke, Alessandra Buonanno, and Bernhard Schölkopf.Real-time gravitational wave science with neural posterior estimation.Physical review letters, 127(24):241103, 2021.
[65]AlvinJK Chua and Michele Vallisneri.Learning bayesian posteriors with neural networks for gravitational-wave inference.Physical review letters, 124(4):041102, 2020.
[66]Jonas Wildberger, Maximilian Dax, StephenR Green, Jonathan Gair, Michael Pürrer, JakobH Macke, Alessandra Buonanno, and Bernhard Schölkopf.Adapting to noise distribution shifts in flow-based gravitational-wave inference.Physical Review D, 107(8):084046, 2023.
[67]Wei Wei and EAHuerta.Gravitational wave denoising of binary black hole mergers with deep learning.Physics Letters B, 800:135081, 2020.
[68]Chayan Chatterjee, Linqing Wen, Foivos Diakogiannis, and Kevin Vinsen.Extraction of binary black hole gravitational wave signals from detector data using deep learning.Physical Review D, 104(6):064046, 2021.
[69]HeWang, Yue Zhou, Zhoujian Cao, Zong-Kuan Guo, and Zhixiang Ren.Waveformer: transformer-based denoising method for gravitational-wave data.Machine Learning: Science and Technology, 2024.
[70]Cunliang Ma, Wei Wang, HeWang, and Zhoujian Cao.Artificial intelligence model for gravitational wave search based on the waveform envelope.Physical Review D, 107(6):063029, 2023.
[71]Chayan Chatterjee, Linqing Wen, Kevin Vinsen, Manoj Kovalam, and Amitava Datta.Using deep learning to localize gravitational wave sources.Physical Review D, 100(10):103025, 2019.
[72]Seiya Sasaoka, Yilun Hou, Kentaro Somiya, and Hirotaka Takahashi.Localization of gravitational waves using machine learning.Physical Review D, 105(10):103030, 2022.
[73]Alex Kolmus, Grégory Baltus, Justin Janquart, Twan VanLaarhoven, Sarah Caudill, and Tom Heskes.Fast sky localization of gravitational waves using deep learning seeded importance sampling.Physical Review D, 106(2):023032, 2022.
[74]Chayan Chatterjee, Manoj Kovalam, Linqing Wen, Damon Beveridge, Foivos Diakogiannis, and Kevin Vinsen.Rapid localization of gravitational wave sources from compact binary coalescences using deep learning.The Astrophysical Journal, 959(1):42, 2023.
[75]RobertE Colgan, KRainer Corley, Yenson Lau, Imre Bartos, JohnN Wright, Zsuzsa Márka, and Szabolcs Márka.Efficient gravitational-wave glitch identification from environmental data through machine learning.Physical Review D, 101(10):102003, 2020.
[76]Tiago Fernandes, Samuel Vieira, Antonio Onofre, JuanCalderón Bustillo, Alejandro Torres-Forné, and JoséA Font.Convolutional neural networks for the classification of glitches in gravitational-wave data streams.Classical and Quantum Gravity, 40(19):195018, 2023.
[77]Jianqi Yan, AlexP Leung, and CYHui.On improving the performance of glitch classification for gravitational wave detection by using generative adversarial networks.Monthly Notices of the Royal Astronomical Society, 515(3):4606–4621, 2022.
[78]Yunan Wu, Michael Zevin, ChristopherPL Berry, Kevin Crowston, Carsten Østerlund, Zoheyr Doctor, Sharan Banagiri, CoreyB Jackson, Vicky Kalogera, and AggelosK Katsaggelos.Advancing glitch classification in gravity spy: Multi-view fusion with attention-based machine learning for advanced ligo’s fourth observing run.arXiv preprint arXiv:2401.12913, 2024.
[79]JohnR Hershey, Zhuo Chen, Jonathan LeRoux, and Shinji Watanabe.Deep clustering: Discriminative embeddings for segmentation and separation.In 2016 IEEE international conference on acoustics, speech and signal processing (ICASSP), pages 31–35. IEEE, 2016.
[80]Yusuf Isik, JonathanLe Roux, Zhuo Chen, Shinji Watanabe, and JohnR Hershey.Single-channel multi-speaker separation using deep clustering.arXiv preprint arXiv:1607.02173, 2016.
[81]Zhuo Chen, YiLuo, and Nima Mesgarani.Deep attractor network for single-microphone speaker separation.In 2017 IEEE international conference on acoustics, speech and signal processing (ICASSP), pages 246–250. IEEE, 2017.
[82]YiLuo, Zhuo Chen, and Nima Mesgarani.Speaker-independent speech separation with deep attractor network.IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26(4):787–796, 2018.
[83]AlexanderH Nitz, Tito DalCanton, Derek Davis, and Steven Reyes.Rapid detection of gravitational waves from compact binary mergers with pycbc live.Physical Review D, 98(2):024050, 2018.
[84]ChristopherMichael Biwer, CollinD Capano, Soumi De, Miriam Cabero, DuncanA Brown, AlexanderH Nitz, and Vivien Raymond.Pycbc inference: A python-based parameter estimation toolkit for compact binary coalescence signals.Publications of the Astronomical Society of the Pacific, 131(996):024503, 2019.
[85]Tito DalCanton, AlexanderH Nitz, AndrewP Lundgren, AlexB Nielsen, DuncanA Brown, Thomas Dent, IanW Harry, Badri Krishnan, AndrewJ Miller, Karl Wette, etal.Implementing a search for aligned-spin neutron star-black hole systems with advanced ground based gravitational wave detectors.Physical Review D, 90(8):082004, 2014.
[86]AlexanderH Nitz, Thomas Dent, Tito DalCanton, Stephen Fairhurst, and DuncanA Brown.Detecting binary compact-object mergers with gravitational waves: Understanding and improving the sensitivity of the pycbc search.The Astrophysical Journal, 849(2):118, 2017.
[87]Morten Kolbæk, Dong Yu, Zheng-Hua Tan, and Jesper Jensen.Multitalker speech separation with utterance-level permutation invariant training of deep recurrent neural networks.IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(10):1901–1913, 2017.
[88]Federico DeSanti, Massimiliano Razzano, Francesco Fidecaro, Luca Muccillo, Lucia Papalini, and Barbara Patricelli.Deep learning to detect gravitational waves from binary close encounters: Fast parameter estimation using normalizing flows.Physical Review D, 109(10):102004, 2024.

	$\displaystyle\widetilde{h}_{A}$	$\displaystyle=\text{Decoder}\left(M_{A}\odot F\right),$		(3)
	$\displaystyle\widetilde{h}_{B}$	$\displaystyle=\text{Decoder}\left(M_{B}\odot F\right).$		(4)