Jiacheng DingSchool of Physics and Astronomy, Sun Yat-Sen University,
No.2 Daxue Road, Zhuhai 519082, ChinaXin WangSchool of Physics and Astronomy, Sun Yat-Sen University,
No.2 Daxue Road, Zhuhai 519082, ChinaCSST Science Center for the Guangdong-Hong Kong-Macau Greater Bay Area, SYSU, ChinaUe-Li PenInstitute of Astronomy and Astrophysics, Academia Sinica, Astronomy-Mathematics Building,
No.1, Sec.4, Roosevelt Road, Taipei 10617, TaiwanCanadian Institute for Theoretical Astrophysics, 60 St. George Street, Toronto, ON M5S 3H8, CanadaCanadian Institute for Advanced Research, 180 Dundas St West, Toronto, ON M5G 1Z8, CanadaDunlap Institute for Astronomy and Astrophysics, University of Toronto,
50 St George Street, Toronto, ON M5S 3H4, CanadaPerimeter Institute of Theoretical Physics, 31 Caroline Street North, Waterloo, ON N2L 2Y5, CanadaXiao-Dong LiSchool of Physics and Astronomy, Sun Yat-Sen University,
No.2 Daxue Road, Zhuhai 519082, ChinaCSST Science Center for the Guangdong-Hong Kong-Macau Greater Bay Area, SYSU, China
Abstract
Foreground removal presents a significant obstacle in both current and forthcoming intensity mapping surveys. While numerous techniques have been developed that show promise in simulated datasets, their efficacy often diminishes when applied to real-world data. A primary issue is the frequency-dependent variations in the instrumental response. In this paper, we propose a novel approach utilizing the internal cross-correlation among different frequencies to calibrate the beam’s frequency fluctuations. Using a simulated dataset that incorporates frequency-dependent random fluctuations into the beam model, we illustrate that our method can achieve considerable improvements over traditional techniques. Our results represent a step forward in enhancing the precision and reliability of foreground removal in intensity mapping surveys.
cosmology, 21cm intensity mapping
††software: CAMB(Lewis etal., 2000),NumPy(Van DerWalt etal., 2011; Harris etal., 2020),SciPy(Virtanen etal., 2020),Astropy(AstropyCollaboration etal., 2013, 2018, 2022),HEALPix/healpy(Górski etal., 2005; Zonca etal., 2019)mpi4py(Dalcin & Fang, 2021),Matplotlib(Hunter, 2007),OpenMPI(Gabriel etal., 2004)1 Introduction
The large-scale structure of the Universe holds invaluable information, providing insights into various aspects such as galaxy evolution, fundamental physics including dark matter, dark energy, alternative theories of gravity, and the early Universe, etc. For over four decades, galaxy surveys have played an important role in enhancing our understanding of the Universe, including the 2dF Galaxy Redshift Survey111http://www.2dfgrs.net/ Colless etal. (2001), the 6dF Galaxy Survey222http://www.6dfgs.net/Jones etal. (2004), the WiggleZ Dark Energy Survey333https://wigglez.swin.edu.au/ Drinkwater etal. (2010), and SDSS-III BOSS444https://www.sdss3.org/Anderson etal. (2014), DES555https://www.darkenergysurvey.org/Collaboration: etal. (2016), DESI666https://www.desi.lbl.gov/Collaboration etal. (2023), and LSST777https://www.lsst.org/Ivezić etal. (2019) in the future. In recent years, 21cm intensity mapping has emerged as a alternative technique to probe the large-scale structure (Scott & Rees, 1990). This method, focusing on the 21cm line emission from neutral hydrogen (HI), differs from conventional galaxy surveys that observe individual galaxies. Instead, 21cm intensity mapping captures the collective emission from multiple galaxies, mapping the intensity fluctuation without resolving individual galaxies. This efficient approach enables us to rapidly map matter distribution over large volumes of Universe. Significant achievements have been made in this area over the past decade. These include the successful cross-correlation of 21cm maps with galaxy samples(Masui etal., 2013; Wolz etal., 2022; Cunnington etal., 2023) and the measurement of the auto-power spectrum using interferometer(Abdurashidova etal., 2022; Paul etal., 2023). Consequently, numerous ongoing surveys and forthcoming next-generation radio telescopes (SKA888https://www.skao.int/en/, HERA999https://reionization.org/(DeBoer etal., 2017)) are poised to significantly enhance the detection capabilities, and will eventually lead to cosmological constraints using this technique.
Despite these advancements, significant challenges remain, particularly with foreground contamination. Being several orders of magnitude brighter than the cosmological signal, these foregrounds substantially hinder progress in auto-correlation measurements, especially when compared to cross-correlation with galaxy surveys – a milestone achieved over a decade ago. Recent breakthrough in detecting the auto-power spectrum using the MeerKAT interferometry mode and foreground avoidance techniques(Paul etal., 2023) marks significant progress. However, the long baselines of the MeerKAT telescope are confined to observing only smaller scales, which limits their ability in deriving meaningful cosmological constraints.
Many different techniques have been devised to overcome this challenge, including methods like spectral fitting, Principal Component Analysis (PCA) (Masui etal., 2013; Switzer etal., 2013, 2015; Bigot-Sazy etal., 2015), Independent Component Analysis (ICA) (Chapman etal., 2012; Wolz etal., 2014), and the Karhunen-Loève transform (KL) (Bégin etal., 2022) and other machine learning based methods (Mangena etal., 2020; Makinen etal., 2021; Wadekar etal., 2021; Villanueva-Domingo &Villaescusa-Navarro, 2021) etc. However, while many of these methods demonstrate reasonable effectiveness in extracting signals from idealized, simulated datasets, real-world complexities often hinder their practical application.
One complexity arises from the chromaticity of the instrument. In interferometric observations, the chromatic response causes a mixing of low-wavelength foreground modes with small-scale HI signal within a wedge-like region, further restricting the Fourier space where foreground avoidance techniques can be applied. While this effect is less a issue in telescopes operating in single-dish mode, chromaticity can still be a significant factor. This is largely due to the inherent imperfections in any instrument we construct, which introduce a multiplicative, frequency-dependent noise into the response beam. The convolution of this noisy beam with the observed sky can significantly degrade the effectiveness of most, if not all, existing foreground removal methods. This includes techniques ranging from blind PCA/ICA and parametric fitting to others that necessitate knowledge of the instrumental response, such as the Karhunen-Loève m-modes and various machine learning-based methods.
At the first glance, the most straightforward solution to mitigate beam variation appears to be simply improving the calibration or measurement accuracy of the beam. Indeed, this approach has been widely pursued, with significant efforts dedicated to improving beam understanding through methods such as drone measurement (Chang etal., 2015) and holographic measurements (Hunter etal., 2011; Iheanetu etal., 2019). In this paper, however, we introduce a novel approach to address this issue. Our core concept is that it is the relative beam differences across frequencies that primarily influence the success of foreground removal. An unknown, frequency-independent deviation in the beam only introduces an additional window function to the detected signal, with minimal impact on the efficiency of the foreground removal process. Therefore, we propose a Correlation-Based Beam Calibration method (CBC), designed to internally calibrate the frequency variations among different frequency channels.
The structure of this paper is as follows: Section II provides a concise overview of our method, outlining the fundamental concept. Section III delves into the detailed methodology. Section IV describes the process of generating the simulated data, followed by the presentation of our results in Section V. The paper concludes with a discussion in Section VI. In this paper, we assume a fiducial cosmology with parameters of (, , , , , , )
2 Instrumental Chromaticity and Foreground Removal
To demonstrate the fundamental principle of our method, let us begin by presuming that the observed data have undergone processing up to the map-making stage, yielding an intensity map. This map, represented as the temperature , is a function of the line-of-sight (LoS) direction and the frequency , which is composed of foregrounds and cosmological signal
(1) |
To proceed, we make the basic assumption that the observed sky map is a convolution of the true sky with a frequency-dependent power beam
(2) |
where represents the observed sky temperature, with the tilde indicating the quantity as post-instrumentally ‘observed’.
As will be seen, this assumption is essential here, as our methodology relies on filtering within the Fourier space. For many radio telescopes, this assumption holds reasonably true, although the observational strategy or subsequent data analysis procedures might introduce some position-dependent response functions. A case in point is the 19-beam L-band receivers of the Five-hundred-meter Aperture Spherical radio Telescope (FAST). In its drift scan mode (Zhang etal., 2019; Hu etal., 2021; Li etal., 2023), disparate sky regions may be observed by different subsets of these 19 beams. Consequently, without proper compensation during the map-making process, any disparities among the receivers might lead to a violation of our assumption.
In the Fourier space, Equation (2) is expressed as
(3) |
where is the Fourier conjugate of sky coordinates . Here is the speed of light and is the spatial separation between elements of the instrument. In an ideal radio telescope observation, the beam function scales linearly with frequency. In Fourier space, this implies that the dependency on is expressed through the relation
(4) |
In reality, no instrument is flawless, and these imperfections could significantly influence the performance of foreground removal techniques. Many existing methods for foreground subtraction rely on having at least some knowledge of the beam characteristics. An imperfect or noisy telescope beam can compromise the accuracy of these techniques. Here, the ‘imperfect beam’ refers to any deviation from the ideal antenna response pattern, which can stem from a range of issues including manufacturing inaccuracies, mechanical structural deformations of the antenna, errors in feed positioning, and frequency-dependent responses of electronic components, etc. Environmental conditions such as temperature fluctuations and wind can induce additional deformations. While these factors indeed influence the beam, their temporal variability also challenges our assumption of a consistent convolution process. As such, they fall outside the scope of this paper.
To gain a clearer understanding of how frequency-dependent variations in the beam response can cause distortion and potential intermixing of foreground and cosmological signals, it is instructive to examine specific cleaning techniques. For our analysis, existing foreground removal methods can be divided into two categories. The first category includes methods that explicitly require knowledge of the beam function. This group encompasses forward-modeling methods (Bernardi etal., 2011; Sullivan etal., 2012), the m-mode formalism which relies on the input beam function to construct a signal-to-contaminant-ratio ordered Karhunen–Loève basis (Shaw etal., 2014), and various semi-blind methods that assume prior knowledge of both the foreground and the instrument (Zuo etal., 2023). Additionally, this category includes machine learning-based approaches (Gillet etal., 2019; LaPlante & Ntampaka, 2019; Makinen etal., 2021; Ni etal., 2022; Gheller & Vazza, 2022; Shi etal., 2023). For these methods, a precise understanding of the instrumental response is essential. The second category includes methods that do not explicitly require an instrumental response. However, they can still be significantly affected by a noisy beam. For instance, some techniques that rely on the spectrum’s smoothness, such as parametric and certain non-parametric fitting methods, are particularly vulnerable. A noisy beam disrupts the underlying assumption of smoothness, rendering these techniques less effective or inapplicable.
Another type of method within this category is the blind methods, notably PCA and ICA, as detailed in various studies (Masui etal., 2013; Alonso etal., 2015; Bigot-Sazy etal., 2015; Stone, 2002; Alonso etal., 2015; Tharwat, 2021). These techniques are generally more adept at handling instrumental systematics than parametric fitting approaches and are widely employed in real data analysis, making them the primary focus of our paper. In these methods, the data cube is interpreted as a combination of the foregrounds , the 21 cm signal , and instrumental noise . Here, is expressed as a two-dimensional matrix in the notation , where is the number of frequency channels and the number of pixels. The goal is to derive a cleaned map by excluding the foreground elements from the observed data. This is achieved as follows:
(5) |
where represents the components matrix, which could be either the most dominant modes (PCA) or the most non-Gaussian modes (ICA). The first -components are selected with matrix , signifying approximate foregrounds modeling as . However, chromatic fluctuations in the instrument may modify the decomposed modes, leading to a confusion between the 21cm signal and beam fluctuations, thereby reducing the efficacy of these blind methods. To illustrate this, consider a simplified noisy beam model. In this model, three-dimensional random noise is added in the frequency- space to a Gaussian beam :
(6) |
where represents a uniformly distributed random number. The specifics of this model are further elaborated in section 4.2.
In Figure (1), we illustrate the impact of a noisy beam on PCA foreground removal. The upper panel illustrates the eigenvalues resulting from the SVD decomposition, where a rapid decrease in eigenvalues indicates that the spectral-smooth foreground can be effectively captured by the initial few modes. As shown, with a frequency-independent beam (marked by red squares), the first four eigen-modes sufficiently describe the foregrounds over six orders of magnitude. This sets the baseline result for the most ideal scenario. Similarly, a frequency rescaling of the Gaussian beam (see Eq. 4) does not significantly alter the results (shown as red diamons). More crucially, a frequency-independent noisy beam, where varies only in the plane but not in the frequency direction (indicated by blue crosses), aligns closely with our ideal beam model. This observation is further supported in the lower panel, which presents the cross-correlation ratio between the true cosmological signal and the ‘cleaned’ map after the subtraction of three SVD modes.
Conversely, with a increasing level of the frequency-dependent beam noise , the eigenvalues descend more gradually, indicating a more complex data structure. As a result, more modes must be removed to reduce the map amplitude, leading to a more significant loss of the signal. At the same time, the cross-correlation ratio between the cleaned map (with three subtracted SVD modes) and the HI map starts to decline, a consequence of increased residual power. Notably, even a beam noise level of can substantially reduce the cross-correlation ratio, dropping it from near to about . This highlights the substantial impact of beam noise on effective foreground removal.
3 Correlation-based Beam Calibration
The above discussion clearly demonstrates that a frequency-independent fluctuations in the beam pattern have a minimal impact on foreground cleaning. This observation highlights the importance of understanding the relative beam variation across frequencies, rather than focusing solely on accurately measuring its absolute pattern. We would like to remind readers that this concept is not entirely new. The traditional SVD method (denoted as ‘TM’) has previously sought to convolve the observed dataset with a common beam function, albeit under the assumption that the beam is well described by a Gaussian function (Masui etal., 2013)
(7) | |||||
Here represents the size of the Gaussian beam at frequency , with being the beam width at the reference frequency which is also the maximum value of . The scaling factor, , is selected from a range of values, such as in MeerKAT study (Cunnington etal., 2023) or in GBT measurement (Condon & Ransom, 2016), to further smooth out any noise arising from this process. It becomes evident that applying the filter to the observed map results in a map convolved with the same Gaussian beam at all frequencies
(8) | |||||
Subsequently, in this section, we will expand on this concept and introduce our correlation-based approach, designed to effectively calibrate for the frequency-dependent variations in the beam pattern.
3.1 Frequency-dependent Beam Fluctuation
Generally, a realistic beam function typically has a complex functional form, showing variations in both and . As demonstrated in the previous section, the efficacy of foreground removal depends on the beam’s relative response. This prompts the decomposition of the beam into two distinct components:
(9) |
where denotes the frequency-independent part, aligned with the beam at the reference frequency, i.e., . On the other hand, the frequency-dependent relative beam is defined as . In the following, the primary objective of our method is to determine the relative beam with existing data.
To achieve this, we aim to develop a filtering function designed to uniformize the beam response across all frequencies
(10) |
Consequently, applying this filter to the observed map
results in a map that is convolved with the same reference beam at every frequencies.
3.2 Correlation-based Beam Calibration
Therefore, the core idea of our algorithm revolves around obtaining the relative beam through internal calibration.First, let us re-express the temperature map at a given frequency and cell as the product of the frequency-dependent relative beam and the remaining component
(12) |
Here we have ignored the noise term and absorbed the frequency-independent beam in the temperature map and define
(13) |
To proceed, we aim to construct a filter denoted as . Applying this filter to observed temperature at each frequency channel effectively inverts the relative beam, yielding an estimator for
(14) |
Consequently, our goal is to design the filter so that it functions similarly to the relative filter as described in Eq.(10).This could be achieved via the minimal variance estimator, derived by examining the variance
(15) |
where presents the averaging within specific grid cell in . Minimizing leads to the well-known Wiener filter (WF) 101010hence this explains the meaning of the superscript in Eq. (14)
(16) |
Here, we have introduced the correlation function , defined as
(17) |
Substituting Eq.(3) of the observed temperature map and temporarily neglecting the noise matrix , we obtain
(18) | |||||
It is apparent that
(19) |
where the additional intrinsic temperature factor is defined as
(20) |
It is apparent that if we ignore the frequency variation of the temperature map, causing the factor to become unity, the constructed filter simply represents the average value of the relative beam across each given cell. Therefore, is indeed the filter that meets our requirements.However, in reality, the image of the foreground varies with frequency, thus introducing an additional undetermined function on top of the relative beam. Various strategies can be employed to mitigate its influence, including using a foreground model as prior information. As we will discuss in detail in Section 5, the impact of this term is not significant.
Furthermore, observational data contains instrumental noise (Equation 3), making the filter described in Equation (19) usually sub-optimal for analyzing real data.Various methods can be adopted to effectively isolate the beam variations in such situation. We will explore these approaches in more detail and discuss their applications to real data analysis in a separate paper.In this work, however, it is worth mentioning that a straightforward solution involves considering the full correlation matrix across all frequencies
(21) |
Here the observed temperature map is a combination of sky and noise . Assuming that the amplitude of the sky is much larger than instrumental noise, we can then perform the SVD of matrix
(22) |
where the diagonal matrix store all eigen-values in decending order, and (and ) lists the corresponding eigen-vectors.As long as the sky, primarily comprised of the foreground, dominates the matrix , one could approximate the observed temperature with the value of the first eigen-mode.Therefore, in a similar approach to constructing the filter in Eq.(19), we can define the filter using only the first eigenvector , which can be expressed as:
(23) |
The only underlying assumption here is the dominance of the sky temperature relative to the instrumental noise, which is achievable with sufficient integration time.Consequently, Eq. (23), representing the ratio between the first eigen-modes at two different frequencies, characterizes the desired filter with reduced noise influence.Therefore, this approach has the advantage of requiring minimal information about the noise covariance matrix.
4 Simulation Data
In this section, we will provide a detailed overview of the technical specifics of our simulated data set, covering both the sky and beam models.
4.1 Sky Model
For simplicity, the sky model for our simulation is generated using the publicly available code Alonso etal. (2014) 111111http://intensitymapping.physics.ox.ac.uk/CRIME.html,which includes five distinct components:(a) HI signal,(b) Galactic synchrotron emission,(c) Galactic free-free emission,(d) Extra-galactic free-free emission,(e) Extra-galactic point sources emission.As demonstrated in Alonso etal. (2014), the HI signal here is derived from the lognormal model (Coles & Jones, 1991) of matter fluctuation , which is then transformed into a map of 21cm brightness temperature fluctuations
(24) |
where represents the HI overdensity, smoothed over the volume element defined by and . Here the HI fraction is assumed to be fixed. Although a more complex modeling approach might be beneficial, it is not critical for the purposes of our study. In this paper, we consider the frequency range between corresponding to the redshift range in our choice of cosmology.
In terms of foreground modeling, models the large-scale Galactic synchrotron radiation (b) by extrapolating the Haslam map(Haslam etal., 1982). This extrapolation utilizes a direction-dependent spectral index derived from the Planck Sky model(Delabrouille etal., 2013), formulated as:
(25) |
where is the synchrotron spectral index, varying with the line-of-sight (LoS) direction . To capture structures on scales smaller than Haslam resolution, incorporates a constrained Gaussian realization with parameters from (Santos etal., 2005; Alonso etal., 2014). Similarly, for the Galactic free-free (c) and Extra-galactic point sources (e) emissions, also uses Gaussian realizations based on specific angular power spectra (Alonso etal., 2014).
From the full-sky simulated maps, we then cutout a mock data cube measuring in angular direction and spanning a frequency range of . Due to computational constraints, the angular resolution of the maps is set at , corresponding to in the format.
As illustrated in Figure (2), we have chosen a relatively clean region with right ascension and declination . In the frequency domain, we have divided the data into 128 frequency channels spanning the range from to . Subsequently, we proceed to grid this dataset into a three-dimensional cube with dimensions . This raw data cube will then be convolved with beam functions, as detailed in the following subsection, Sec.4.2.
Furthermore, to evaluate the impact of the intrinsic frequency variations in the sky (as described in Eq. 20), we consider a simplified foreground model in which the spatial patterns at a reference frequency are held fixed, while the overall amplitudes are adjusted according to the respective frequency. In the following, this model is labeled to as “”, whereas the original foreground model is denoted as “”. In Figure (3), we display the map of temperature ratio between two frequencies: at and the reference frequency . The left panel displays the original sky map, while the right panel showcases the simplified model. In the simplified model, we observe some noise around the value of , which is attributed to the independently added HI signal.
4.2 Beam Model
In this subsection, we introduce the noisy beam model utilized in this study.For simplicity, we consider a single dish telescope with a Gaussian model as the unperturbed beam
(26) |
with . The full-width at half-maximum (FWHM) is determined by
(27) |
where is the diameter of the telescope dish. In many instances, this model offers a sufficiently accurate approximation. For example, as demonstrated in Liao etal. (2016), it provides a robust first-order approximation for both the intensity and polarization of the Green Bank Telescope.
As demonstrated in section 3, our method is implemented in Fourier space. In this domain, the Gaussian beam is represented as
(28) |
In the following, we focus on a single dish telescope resembling the Green Bank Telescope (GBT) with a -meter diameter, corresponding to the angular resolution that varies from to across the frequency range .
In this paper we consider two simple models of beam fluctuation. The first model (Model-I) treats the beam fluctuation as a three-dimensional random field in the space,
where is the beam noise, and is a zero mean random field in space, defined as
(30) |
Here, is the amplitude coefficient of the noise, and is a uniformly distributed random variable within the range of to . Given the entirely random nature of the noises across frequencies, this model represents a pessimistic scenario regarding potential beam noise.
In our second model of beam noise (Model-II), we consider a more realistic scenario where the noise originates within the telescope’s spatial configuration. This noise manifests as a two-dimensional random field
(31) |
where represents the distance within the telescope’s frame. This noise field is subsequently rescaled and projected onto the space at various frequencies using the relation
(32) |
In Figure (4), present our noisy beam model for a meter single-dish telescope, showcasing in both Fourier space (panels a-d) and configuration space (panel e). Panels (a) and (b) illustrate the noisy beam at and its noise component , respectively. The resolution of in Fourier space is about , as shown in panel (b), with positive values indicated in red and negative values in blue. Model-I is characterized by completely random fluctuations in the frequency direction. This is exemplified in panel (c), which depicts a slice of the one-dimensional beam as a function of frequency. Panel (d) displays the one-dimensional beam at a specific frequency () under various noise levels , while keeping the underlying random field constant. Here, suggests that the beam fluctuates within a range of to . Finally, panel (e) illustrates the noisy side-lobes in configuration space, resulting from the beam noise . The normalization chosen here ensures at the center. We notice that these side-lobes in configuration space, emerged from simple Fourier space random noise, do resemble features in actual telescope beams, highlighting the practicality of our beam noise model.
In Figure (5), we showcase the one-dimensional beam fluctuation in Fourier space, with different colors representing three distinct frequency channels around . The upper panel illustrates our ‘pessimistic’ Model-I with , which is the three-dimensional random noise model, described by Eq. (30). In this model, the beam fluctuations are completely independent across frequencies, indicating a potentially larger impact on foreground removal. The lower panel shows a similar plot for Model-II with , where the noise resides in the telescope’s spatial domain. In this model, the frequency-dependence of the noise arises from the chromatic relationship described in Eq. (32). From the plot, it is clear that the noise profiles appear similar at lower values for nearby frequencies, with deviations only becoming noticeable at higher values.Our choice of here is consistent with real-world beam fluctuations. For instance, Liao etal. (2016) measured the beam pattern of the Green Bank Telescope (GBT) using astrophysical sources such as quasars and pulsars, characterized by two-dimensional Gauss-Hermite coefficients.Using their beam model, we estimate that the root-mean-square (RMS) of the beam fluctuation along the frequency direction could reach as high as .
4.3 Mock Observation
After convolving our simulated datacube with the noisy beam model, we derive the mock observation data in Fourier space
(33) |
as well as the corresponding map in real space .
Figure (6) offers a comparative view of the mock observation data against the raw temperature map. The upper panels show the raw sky map (upper-left), denoted as , alongside its convolution with the noisy telescope beam (upper-right), under beam Model-I (Eq.4.2), at a frequency of . In the middle-left panel, we illustrate the difference between these two plots, effectively highlighting the temperature variations caused by the beam noise of the telescope. Similarly, the middle-right panel displays the HI variation caused by the beam noise. Finally, the bottom part of the figure demonstrates the temperature fluctuation, , along the frequency axis. It is evident from this display that such frequency variations will significantly challenge the effectiveness of foreground subtraction methods.
5 Results
Before proceeding to foreground subtraction and statistical analysis, our CBC method constructs a frequency-dependent filter . As detailed in (3), this filter is generated through the cross-correlation of maps in the space, comparing each frequency with a reference frequency . This section aims to demonstrate the effectiveness of this approach. We will first discuss the calibration of the beam, followed by showcasing the results achieved through foreground removal.
5.1 Beam Calibration
In an ideal scenario, after applying the filter , observational maps at different frequencies should all be convolved with the same beam, specifically the one at the reference frequency . However, in practice, intrinsic temperature variations across frequencies complicate this process, as they introduce an additional temperature factor (Eq. 19). To dissect the separate impacts of these two factors, we also consider a simplified ‘’ foreground model. In this model, the temperature factor is held constant in the space. Meanwhile, the -dependent overall temperature amplitude of each channel is adjusted to aligns with that of the actual foreground model.
In the upper panel of Figure (7), we illustrate a one-dimensional slice of the filter along with its individual components. Here, the sky temperature ratio remains constant (blue dash-dotted line) at a value of approximately . Consequently, the normalized constructed filter (red solid line) closely resembles the relative beam (dotted line) that we introduced in the beam model. This convincingly validates the efficacy of our method.
In the lower panel of Figure (7), we present the filter for the more realistic ‘’ foreground. In contrast to the ‘’ model, the intrinsic variation in sky temperature (blue dash-dotted line) has a substantial influence on the estimation of the relative beam.As depicted, the fluctuations in the temperature ratio are considerably larger than the beam noise we introduced (black dashed line). As a result, the constructed filter (red solid line) bears a closer resemblance to the temperature ratio than the relative beam.However, this extra factor actually does not invalidate our method. In fact, as we will demonstrate in the next subsection, the foreground-cleaned map following the aforementioned procedure still exhibits considerable improvements compared to traditional methods. This is primarily because the variation in sky temperature, as shown in Figure (8), changes gradually along the frequency axis. In contrast, while the beam noise we introduced might appear small, it is either completely random (beam Model-I) or somewhat random (Model-II) along frequency. This more pronounced frequency dependence has a much more impactful influence on the result of foreground removal.
5.2 Foreground Removal and Statistical Measurements
After estimating and applying the CBC filter to our mock data, we now proceed with foreground removal and statistical analysis. Specifically, we adopt the blind Principal Component Analysis (PCA) cleaning technique following the methods detailed in Masui etal. (2013) and Switzer etal. (2013).We begin by representing the temperature map as a matrix , with dimensions . The PCA modes are then obtained through the decomposition of , where is a diagonal matrix arranged in descending order. The ‘cleaned’ map, obtained by subtracting the first PCA modes, is calculated using Eq. (5).We continue by calculating the auto-power spectrum of the cleaned map
(34) |
Additionally, we also compute the cross-power spectrum with the injected true HI signal. Specifically, our interest lies in the cross-correlation ratio between the two fields, defined as
(35) |
Here represents the injected HI signal and is the cross-power spectrum between the signal and the cleaned map.
As detailed in Section 4.1, our mock observational data encompass the region spanning from to in frequency, to in right ascension (RA), and to in declination (DEC). With our chosen cosmological model, this region translate into a comoving box with dimensions of .To assess the effectiveness of our method, we compare the outcomes achieved with the CBC approach to those obtained using a traditional method.The latter involves convolving data with a common Gaussian beam. This traditional approach, as described in Masui etal. (2013), includes the convolution with an additional kernel, equivalent to Eq. (7). In this paper, we apply this filter in real space.Following the discussion in Section 4.3, we will conduct a comprehensive comparison that incorporates two foreground models: the simplified ‘’ model and the realistic ‘’ foregrounds, two beam noise models (Model I and II), and two methods of beam calibration: the traditional method (TM) and our own correlation-based calibration method (CBC).
To establish a baseline result, Figure (9) presents both the auto power spectrum of the cleaned map (upper panels) and the cross-correlation ratio with true cosmic signal (lower panels), assuming that the relative beam is precisely known. As expected, both metrics agree with the expectation of an ‘ideal’ foreground removal, with only a few modes’ subtraction resulting in a low auto-power spectrum and a cross-correlation ratio nearing unity. This observation remains consistent across different foreground models and beam calibration methods. However, the results from the simplified ‘’ (left panels) model demonstrate better performance. Particularly, subtracting as few as modes (blue-dashed line) is sufficient to remove this simple foreground contamination. Furthermore, as shown in the cross-correlation, the more realistic foreground model tends to result in an over-subtraction of the signal at the large-scale (). Notice that we did not differentiate between two beam models in this context, as they yield very similar results when the ‘true knowledge’ of beam fluctuations is known.
We then turn to assessing the efficacy of our CBC method against the traditional approach of the Gaussian common beam regularization. Figure (10) illustrates the result for the three-dimensional random beam model, i.e. Model-I, with noise amplitude .As shown, our CBC method consistently outperforms the traditional approach, regardless of the number of subtracted modes (), or the choice of foreground models. For each chosen value of , the CBC method significantly reduces the amplitude of the auto-power spectrum by a factor of several and enhances the cross-correlation ratio by approximately . These outcomes suggest that, with such a noisy beam, the CBC method excels in removing foregrounds more effectively and in preserving the HI signals more efficiently.
For both methods, a sudden shape change appears in the auto-power spectrum around , resulting in a bump in the cross-correlation ratio . This signature is the residual foreground caused by the mixing of beam noise with the intrinsic structure in the foreground. Such foreground structure can be partially observed in Figure 7. In the traditional method, we observe that the cross-correlation ratio undergoes slightly more degradation than in the CBC method, especially at larger scales (). Additionally, the performance impact when using the realistic ‘’ foreground model is surprisingly insignificant compared to the simpler ‘’ model. This can be partly attributed to the slow frequency variation of the temperature ratio . Despite the significant spatial variation of relative to the beam noise, as illustrated in Figure (7), its frequency dependence is actually quite smooth. This is evident in Figure (8), where the right panel shows a different but slow frequency change across various spatial values.
To examine how the performance of CBC method varies with the noise levels of the beam, we also analyze the results for two different amplitude, , in Model-I. Figure 11 and 12 illustrate the outcomes for and , respectively. For smaller beam noise (), we are able to subtract fewer modes to achieve a similar level of the cross-power spectrum, and the auto-power spectrum also approaches more closely to the injected signal for both the traditional and CBC methods. Although the CBC method continues to outperform the traditional approach, the correlation ratio for the CBC method stabilizes around at large , compared to for the traditional method. This indicates a reduced improvement factor of about one-third.In contrast, with larger noise (), greater residual foregrounds lead to a higher auto-power spectrum and a lower cross-correlation ratio for both methods. Regardless, the CBC method still improves the cross-correlation ratio by approximately , which is a similar level of improvement compared to our fiducial model with .
Figure 13 displays the results for beam Model-II with a noise level of . The higher noise amplitude compared to Model-I is due to the significantly reduced frequency dependence of the noise in Model-II, as evidenced in Figure 5.A notable distinction from Figure 10 is the increased over-subtraction at larger scales (), observed in both the traditional approach and the CBC method, regardless of the foreground model considered. This difference arises because, in this beam model, the two-dimensional noise is rescaled along the frequency direction with , making the frequency dependence of the noise also scale-dependent. This behavior is apparent in Figure 5, where at lower values, corresponding to lower and shorter distance , the dependency on is less pronounced than at higher values. Such a condition leads to a unique interplay between the scales and the frequency of fluctuations. As a result, the SVD modes, derived from the frequency singular modes, do not perform uniformly across all scales. The maps produced are smoother in at larger angular scales, leading to over-removal in the SVD-cleaned maps. Despite this distinctive characteristic, our CBC method continues to outperform the traditional approach. As one can see, the level of improvement observed in Figure 13 is similar to what is observed in Figure 10.
We also examine the performance of beam Model-II at various noise levels. The results are illustrated in Figures 14 and 15 for and , respectively. The overall feature of over-subtracted large scale modes persists in both situations. Similar to Model-I, the performance difference between CBC and the traditional method is much smaller for the low-noise beam model, with the improvement in the cross-correlation ratio ranging from ten percent to a few percent. Conversely, for a noise level of , the improvement reaches approximately one-third.
6 Discussion and Conclusions
Foreground removal plays a pivotal role in the analysis of 21cm intensity mapping data. Most existing techniques require at least a basic understanding of the foreground, the instrument, or both. Even in blind methods, such as Principal Component Analysis (PCA), the specifics of the instrument significantly influence the efficiency of the results. Therefore, it is not surprising that these methods often show diminished performance in real-world applications, where actual instrumental beams are frequently noisy. Considerable efforts have been made towards accurate beam modeling and measurement. In this paper, we introduce a novel method that utilizing internal cross-correlation between different frequencies to calibrate the beam’s variation with frequency.
To simulate the real-world beam fluctuations, we consider two specific models of beam noise. The first is a random noise in the three-dimensional space, while the second is a two-dimensional random noise in the instrumental space, which is then projected along the frequency axis with varying coordinates. Interestingly, the performances of these two models are somewhat distinct and complementary. Beam Model-II, in particular, experiences an over-subtraction at larger scales. However, when compared with traditional methods that overlook beam variation, our approach shows a considerable improvement. It achieves an approximate increase in the cross-correlation ratio and a significant reduction in the auto-power spectrum by several factors.
The filter constructed using our method incorporates two main factors: the relative change in the beam between two frequencies and the intrinsic temperature variation. To assess the impact of the later, we consider a simplified foreground model where the spatial pattern of the foreground remains constant. Interestingly, even substantial spatial fluctuations in the foreground temperature factor have a minimal effect on the performance. This resilience can be attributed to the smooth frequency variation of the foreground. Therefore, these findings further validate the effectiveness of our method.
Acknowledgments
X.W. is supported by the National SKA Program of China (Grants Nos. 2022SKA0110200 and 2022SKA0110202). U-L.P. receives support from Ontario Research Fund—research Excellence Program (ORF-RE), Natural Sciences and Engineering Research Council of Canada (NSERC) [funding reference number RGPIN-2019-067, CRD 523638-201, 555585-20], Canadian Institute for Advanced Research (CIFAR), Canadian Foundation for Innovation (CFI), the National Science Foundation of China (Grants No. 11929301), Simons Foundation, Thoth Technology Inc, and Alexander von Humboldt Foundation.
References
- Abdurashidova etal. (2022)Abdurashidova, Z., Aguirre, J.E., Alexander, P., etal. 2022, TheAstrophysical Journal, 925, 221
- Alonso etal. (2015)Alonso, D., Bull, P., Ferreira, P.G., & Santos, M.G. 2015, Monthly Noticesof the Royal Astronomical Society, 447, 400
- Alonso etal. (2014)Alonso, D., Ferreira, P.G., & Santos, M.G. 2014, Monthly Notices of theRoyal Astronomical Society, 444, 3183
- Anderson etal. (2014)Anderson, L., Aubourg, E., Bailey, S., etal. 2014, Monthly Notices of theRoyal Astronomical Society, 441, 24
- AstropyCollaboration etal. (2018)AstropyCollaboration, Price-Whelan, A.M., Sipőcz, B., Günther, H.,etal. 2018, The Astronomical Journal, 156, 123
- AstropyCollaboration etal. (2022)AstropyCollaboration, Price-Whelan, A.M., Lim, P.L., Earl, N., etal.2022, The Astrophysical Journal, 935, 167
- AstropyCollaboration etal. (2013)AstropyCollaboration, Robitaille, T.P., Tollerud, E.J., Greenfield, P.,etal. 2013, Astronomy & Astrophysics, 558, A33
- Bégin etal. (2022)Bégin, J.-M., Liu, A., & Gorce, A. 2022, Phys. Rev. D, 105, 083503,doi:10.1103/PhysRevD.105.083503
- Bernardi etal. (2011)Bernardi, G., Mitchell, D.A., Ord, S.M., etal. 2011, Monthly Noticesof the Royal Astronomical Society, 413, 411,doi:10.1111/j.1365-2966.2010.18145.x
- Bigot-Sazy etal. (2015)Bigot-Sazy, M.-A., Dickinson, C., Battye, R.A., etal. 2015, Monthly Noticesof the Royal Astronomical Society, 454, 3240
- Chang etal. (2015)Chang, C., Monstein, C., Refregier, A., etal. 2015, Publications of theAstronomical Society of the Pacific, 127, 1131
- Chapman etal. (2012)Chapman, E., Abdalla, F.B., Harker, G., etal. 2012, Monthly Notices of theRoyal Astronomical Society, 423, 2518
- Coles & Jones (1991)Coles, P., & Jones, B. 1991, Monthly Notices of the Royal AstronomicalSociety, 248, 1, doi:10.1093/mnras/248.1.1
- Collaboration etal. (2023)Collaboration, D., Adame, A., Aguilar, J., etal. 2023, arXiv preprintarXiv:2306.06308
- Collaboration: etal. (2016)Collaboration:, D. E.S., Abbott, T., Abdalla, F., etal. 2016, MonthlyNotices of the Royal Astronomical Society, 460, 1270
- Colless etal. (2001)Colless, M., Dalton, G., Maddox, S., etal. 2001, Monthly Notices of theRoyal Astronomical Society, 328, 1039
- Condon & Ransom (2016)Condon, J.J., & Ransom, S.M. 2016, Essential radio astronomy, Vol.2(Princeton University Press)
- Cunnington etal. (2023)Cunnington, S., Li, Y., Santos, M.G., etal. 2023, Monthly Notices of theRoyal Astronomical Society, 518, 6262
- Dalcin & Fang (2021)Dalcin, L., & Fang, Y.-L.L. 2021, Computing in Science & Engineering, 23, 47
- DeBoer etal. (2017)DeBoer, D.R., Parsons, A.R., Aguirre, J.E., etal. 2017, Publications ofthe Astronomical Society of the Pacific, 129, 045001
- Delabrouille etal. (2013)Delabrouille, J., Betoule, M., Melin, J.-B., etal. 2013, Astronomy &Astrophysics, 553, A96
- Drinkwater etal. (2010)Drinkwater, M.J., Jurek, R.J., Blake, C., etal. 2010, Monthly Notices ofthe Royal Astronomical Society, 401, 1429
- Gabriel etal. (2004)Gabriel, E., Fagg, G.E., Bosilca, G., etal. 2004, in Recent Advances inParallel Virtual Machine and Message Passing Interface: 11th European PVM/MPIUsers’ Group Meeting Budapest, Hungary, September 19-22, 2004. Proceedings11, Springer, 97–104
- Gheller & Vazza (2022)Gheller, C., & Vazza, F. 2022, Monthly Notices of the Royal AstronomicalSociety, 509, 990
- Gillet etal. (2019)Gillet, N., Mesinger, A., Greig, B., Liu, A., & Ucci, G. 2019, Monthly Noticesof the Royal Astronomical Society, 484, 282
- Górski etal. (2005)Górski, K.M., Hivon, E., Banday, A.J., etal. 2005, ApJ, 622,759, doi:10.1086/427976
- Harris etal. (2020)Harris, C.R., Millman, K.J., Van DerWalt, S.J., etal. 2020, Nature, 585,357
- Haslam etal. (1982)Haslam, C., Salter, C., Stoffel, H., & Wilson, W. 1982, Astronomy andAstrophysics Supplement Series, 47, 1
- Hu etal. (2021)Hu, W., Li, Y., Wang, Y., etal. 2021, MNRAS, 508, 2897,doi:10.1093/mnras/stab2728
- Hunter (2007)Hunter, J.D. 2007, Computing in science & engineering, 9, 90
- Hunter etal. (2011)Hunter, T.R., Schwab, F.R., White, S.D., etal. 2011, Publications of theAstronomical Society of the Pacific, 123, 1087
- Iheanetu etal. (2019)Iheanetu, K., Girard, J., Smirnov, O., etal. 2019, Monthly Notices of theRoyal Astronomical Society, 485, 4107
- Ivezić etal. (2019)Ivezić, Ž., Kahn, S.M., Tyson, J.A., etal. 2019, TheAstrophysical Journal, 873, 111
- Jones etal. (2004)Jones, D.H., Saunders, W., Colless, M., etal. 2004, Monthly Notices of theRoyal Astronomical Society, 355, 747
- LaPlante & Ntampaka (2019)LaPlante, P., & Ntampaka, M. 2019, The Astrophysical Journal, 880, 110
- Lewis etal. (2000)Lewis, A., Challinor, A., & Lasenby, A. 2000, The Astrophysical Journal, 538,473
- Li etal. (2023)Li, Y., Wang, Y., Deng, F., etal. 2023, arXiv preprint arXiv:2305.06405
- Liao etal. (2016)Liao, Y.-W., Chang, T.-C., Kuo, C.-Y., etal. 2016, ApJ, 833, 289,doi:10.3847/1538-4357/833/2/289
- Liao etal. (2016)Liao, Y.-W., Chang, T.-C., Kuo, C.-Y., etal. 2016, The AstrophysicalJournal, 833, 289, doi:10.3847/1538-4357/833/2/289
- Makinen etal. (2021)Makinen, T.L., Lancaster, L., Villaescusa-Navarro, F., etal. 2021, Journalof Cosmology and Astroparticle Physics, 2021, 081
- Mangena etal. (2020)Mangena, T., Hassan, S., & Santos, M.G. 2020, Monthly Notices of the RoyalAstronomical Society, 494, 600
- Masui etal. (2013)Masui, K., Switzer, E., Banavar, N., etal. 2013, The Astrophysical JournalLetters, 763, L20
- Ni etal. (2022)Ni, S., Li, Y., Gao, L.-Y., & Zhang, X. 2022, The Astrophysical Journal, 934,83
- Paul etal. (2023)Paul, S., Santos, M.G., Chen, Z., & Wolz, L. 2023, arXiv preprintarXiv:2301.11943
- Santos etal. (2005)Santos, M.G., Cooray, A., & Knox, L. 2005, The Astrophysical Journal, 625,575
- Scott & Rees (1990)Scott, D., & Rees, M.J. 1990, Monthly Notices of the Royal AstronomicalSociety, vol. 247, p. 510, 247, 510
- Shaw etal. (2014)Shaw, J.R., Sigurdson, K., Pen, U.-L., Stebbins, A., & Sitwell, M.2014, ApJ, 781, 57, doi:10.1088/0004-637X/781/2/57
- Shi etal. (2023)Shi, F., Chang, H., Zhang, L., etal. 2023, arXiv preprint arXiv:2310.06518
- Stone (2002)Stone, J.V. 2002, Trends in cognitive sciences, 6, 59
- Sullivan etal. (2012)Sullivan, I.S., Morales, M.F., Hazelton, B.J., etal. 2012, ApJ,759, 17, doi:10.1088/0004-637X/759/1/17
- Switzer etal. (2013)Switzer, E., Masui, K., Bandura, K., etal. 2013, Monthly Notices of theRoyal Astronomical Society: Letters, 434, L46
- Switzer etal. (2015)Switzer, E.R., Chang, T.-C., Masui, K.W., Pen, U.-L., & Voytek, T.C. 2015,The Astrophysical Journal, 815, 51
- Tharwat (2021)Tharwat, A. 2021, Applied Computing and Informatics, 17, 222
- Van DerWalt etal. (2011)Van DerWalt, S., Colbert, S.C., & Varoquaux, G. 2011, Computing in science& engineering, 13, 22
- Villanueva-Domingo &Villaescusa-Navarro (2021)Villanueva-Domingo, P., & Villaescusa-Navarro, F. 2021, The AstrophysicalJournal, 907, 44
- Virtanen etal. (2020)Virtanen, P., Gommers, R., Oliphant, T.E., etal. 2020, Nature methods, 17,261
- Wadekar etal. (2021)Wadekar, D., Villaescusa-Navarro, F., Ho, S., & Perreault-Levasseur, L. 2021,The Astrophysical Journal, 916, 42
- Wolz etal. (2014)Wolz, L., Abdalla, F., Blake, C., etal. 2014, Monthly Notices of the RoyalAstronomical Society, 441, 3271
- Wolz etal. (2022)Wolz, L., Pourtsidou, A., Masui, K.W., etal. 2022, Monthly Notices of theRoyal Astronomical Society, 510, 3495
- Zhang etal. (2019)Zhang, K., Wu, J., Li, D., etal. 2019, Science China Physics, Mechanics &Astronomy, 62, 1
- Zonca etal. (2019)Zonca, A., Singer, L., Lenz, D., etal. 2019, Journal of Open SourceSoftware, 4, 1298, doi:10.21105/joss.01298
- Zuo etal. (2023)Zuo, S., Chen, X., & Mao, Y. 2023, ApJ, 945, 38,doi:10.3847/1538-4357/acb822