ArticlePDF Available

DiFX: A Software Correlator for Very Long Baseline Interferometry Using Multiprocessor Computing Environments

Authors:

Abstract and Figures

We describe the development of an FX-style correlator for very long baseline interferometry (VLBI), implemented in software and intended to run in multiprocessor computing environments, such as large clusters of commodity machines (Beowulf clusters) or computers specifically designed for high-performance computing, such as multiprocessor shared-memory machines. We outline the scientific and practical benefits for VLBI correlation, these chiefly being due to the inherent flexibility of software and the fact that the highly parallel and scalable nature of the correlation task is well suited to a multiprocessor computing environment. We suggest scientific applications where such an approach to VLBI correlation is most suited and will give the best returns. We report detailed results from the Distributed FX (DiFX) software correlator running on the Swinburne supercomputer (a Beowulf cluster of ~300 commodity processors), including measures of the performance of the system. For example, to correlate all Stokes products for a 10 antenna array with an aggregate bandwidth of 64 MHz per station, and using typical time and frequency resolution, currently requires an order of 100 desktop-class compute nodes. Due to the effect of Moore's law on commodity computing performance, the total number and cost of compute nodes required to meet a given correlation task continues to decrease rapidly with time. We show detailed comparisons between DiFX and two existing hardware-based correlators: the Australian Long Baseline Array S2 correlator and the NRAO Very Long Baseline Array correlator. In both cases, excellent agreement was found between the correlators. Finally, we describe plans for the future operation of DiFX on the Swinburne supercomputer for both astrophysical and geodetic science.
Content may be subject to copyright.
arXiv:astro-ph/0702141v1 6 Feb 2007
DiFX: A software correlator for very long baseline interferometry
using multi-processor computing environments
A.T. Deller
1
, S.J. Tingay, M. Bailes, & C. West
2
Centre for Astrophysics and Supercomputing, Swinburne University of Technology, Mail
H39, P.O. Box 218, Hawthorn, Victoria 3198, Australia
ABSTRACT
We describe the development of an FX style correlator for Very Long Base-
line Interferometry (VLBI), implemented in software and intended to run in
multi-processor computing environments, such as large clusters of commodity
machines (Beowulf clusters) or computers specifically designed for high perfor -
mance computing, such a s multi-processor shared-memory machines. We outline
the scientific and practical benefits for VLBI correlation, these chiefly being due
to the inherent flexibility of software and the fact that the highly parallel and
scalable nature of the correlation task is well suited to a multi-processor com-
puting environment. We suggest scientific applications where such an approach
to VLBI correlation is most suited and will give the best r eturns. We report
detailed results from the Distributed FX (DiFX) software correlator, running
on the Swinburne supercomputer (a Beowulf cluster of 300 commodity pro-
cessors), including measures of the performance of the system. For example, to
correlate all Stokes pro ducts for a 10 antenna array, with an aggregate band-
width of 64 MHz per station and using typical time and frequency resolution
presently requires o f order 100 desktop-class compute nodes. Due t o the effect
of Moore’s Law on commodity computing performance, the total number and
cost of compute nodes required to meet a given correlation task continues to de-
crease rapidly with time. We show detailed comparisons between DiFX and two
existing hardware-based correlators: the Australian Long Baseline Array (LBA)
S2 correlator, and the NRAO Very Long Baseline Array (VLBA) correlator. In
both cases, excellent agreement was found between the correlators. Finally, we
describe plans for the future operation of DiFX on the Swinburne supercomputer,
for both astrophysical and geodetic science.
1
co-supervised through the Australia Telescope Natio nal Facility, P.O. Box 7 6, Epping, NSW 1710, Aus-
tralia
2
Current address: University of Massachusetts Amherst, Department of Astronomy, 71 0 North Pleasant
St, Amherst, MA 01 003-9305, USA
2
Subject headings: Techniques: interferometric instrumentation: interferome-
ters pulsars: general radio continuum: general radio lines: general
1. Introduction
The technique of Very Long Baseline Interferometry (VLBI), as a means to study the
very high angular resolution structure of celestial ra dio sources, was developed in the 1960s
(Clark, Cohen & Jauncey 1967; Moran et al. 1967). Some accounts of the early develop-
ments in VLBI, the scientific motivations for the developments, and t echnical overviews are
given in Finley & Go ss (2000).
VLBI, as with all interferometry at radio wavelengths, hinges on the abilty to obtain
a digital representation of the electric field va ria t io ns at a number of spatially separated
locations (radio telescopes), accurately time-tagged a nd tied to a frequency standard. The
digitised data are transported to a single location for processing (a correlator) and are co-
herently combined in order to derive information about the high angular resolution structure
of the ta rget sources of radio emission. The instantaneous angular resolution R of a VLBI
array in arcseconds is given by R = 2.52 × 10
5
λ
D
, where λ is wavelength of the ra diatio n
being observed (typically centimetres) and D is the maximum projected baseline (the dis-
tance b etween radio telescopes in the array projected onto a plane perpendicular to the
source; typically thousands of kilometers). This yields typical angular resolutions of order
milliarcseconds.
Traditionally, the “baseband data (filtered, down-converted, sampled, and quantised
electric field strength measurements: Thompson, Moran & Swenson 1994) generated at each
radio telescope have been recorded to magnetic tape media, for example: the Mark I system
(Bare et al. 1967); the Mark II system (Clark 1973); the Mark III system (Rogers et al.
1983), the Mark IV system (Whitney 1993); and the S2 system (Wietfeldt et al. 1996). After
observation, the tapes from each telescope were shipped to a purpose-built a nd dedicated
digital signal processor, the correlator. A correlator aligns the recorded data streams, corrects
for various geometrical and instrumental effects, and coherent ly combines the data from the
different independent pairs o f radio telescopes. The correlator output streams, known as t he
visibilities, are related to the sky brightness distribution of the radio source essentially via a
Fourier transform relation (Thompson, Moran & Swenson 19 94).
The two fundamental operations required to combine or correlate the recorded signals
are a Fourier transform (F) and a cross-multiplication (X). The order of these operations
can be interchanged to obtain the same result, leading to the so-called XF and FX correlator
3
architectures. A number of well-known descriptions of the theory and practise of radio
int erferometry describe the technique in varying degrees of detail and elaborat e upon the
differences between XF and FX correlators (Thompson, Moran & Swenson 1994; Romney
1999), and the reader is referred to these texts for the details.
Both XF and FX style correlators have traditionally been highly application-sp ecific de-
vices, based on purpose-built integrated circuits. In the last 20 years, Field Programmable
Gate Arrays ( FPGAs) have become popular in correlator designs, with one prominent exam-
ple being the Very Long Ba seline Array (VLBA) correlator (Napier et al. 1994). FPGAs are
reconfigurable or reprogrammable devices that off er more flexibility than application-specific
int egrated circuits (ASICs) while still being highly efficient.
This paper deals with a departure from the traditional approach of tape-based data
recording and correlation on a purpose-built processor (based on either ASICs or FPGAs).
We have developed a correlator that is based on softwa re known as DiFX (Distributed
FX), which runs within a generic multi-processor computing environment. Such a correlator
int erfa ces naturally to modern hard-disk data recording systems, such as the MkV system
(Whitney 2002) and the K5 system (K ondo et al. 2003), that have now largely replaced ta pe-
based recording systems. Specifically, we have developed this software correlator to support
a new disk-based VLBI recording system that has been deployed across the Australian Long
Baseline Array
1
(LBA) for VLBI. We refer the reader to a detailed discussion of the LBA
hard-disk recording system (LBADR ) that appears elsewhere (Phillips et al. 2007, in prepa-
ration). As our software correlator is more broadly applicable than to just the LBA, we will
not dwell on the details of the LBA recording system in this paper, but rather concentrate on
the characteristics, benefits, and performance of our software correlator, giving brief details of
the recording system when required. The correlator source code, binaries and instructions for
use ar e available for download from http://astronomy.swin.edu.au/~adeller/software/difx/.
The very first VLBI observations were in fact correlated using software on a main-
frame computer. Software correlators were developed simultaneously on an IBM 360/50
at t he National Radio Astronomy Observatory (NRAO) (Bare et al. 19 67) and on an IBM
360/92 at the Goddard Space Flight Centre (Moran et al. 1967). As the early experiments
quickly increased in complexity the recorded data volume also increased and it became nec-
essary to design custom hardware for VLBI correlation. Recent examples of such correlators
include: the NRAO Very Long Baseline Array correlator (Napier et al. 1 994); the Joint In-
stitute for VLBI in Europe (JIVE) correlator (Casse 1999); the Canadian NRC S2 correlator
(Carlson et al. 1999); the Japanese VLBI Space Observatory Programme (VSOP) correlator
1
http://www.atnf.csiro .au/vlbi
4
(Horiuchi et al. 2000); and the Australia Telescope National Facility (ATNF) S2 correlator
(Wilson, Roberts & Davis 1996). Table 1 compares some of the basic properties of some
currently-operational hardware VLBI correlators.
Recently, the pace of development of commodity computing equipment (processors, stor-
age, networking etc) has outstripped increases in VLBI computational requirements to the
point that the correlation of VLBI data using relatively inexpensive supercomputer facili-
ties is feasible. The correlation algorithm is “ embarrassingly parallel” and very well suited
to such parallel computing architectures. These facilities are not purpose-built f or corre-
lation but are inherently multi-purpose machines, suited to a wide range of computational
problems.
This approach to correlation gives rise to significant scientific benefits, under certain
circumstances. The benefits stem from the basic char acteristics of correlation, software engi-
neering considerations, and the computing environments. Software is more flexible and easier
to redesign than application-specific hardware or even FPGA-based processors (although the
programming tools for FPGAs are developing rapidly). The highly parallel nature of the
correlation problem, coupled with the availability of high-level programming languages and
optimised vector libraries means that a reasonably general software correlator code can be
written quickly and be used in a va r iety of different computing environments with minimum
modification, or in a dynamic environment where computing resources and/or significant
scientific r equirements can change rapidly with time.
However, the trade-off for flexibility and the convenience of high-level programming tools
is reduced efficiency for any given task, compared to an application-specific or FPGA-based
solution. Put simply, the Non-Recoverable Engineering (NRE) costs for a software correlator
are much lower than for a hardware correlator, but the cost per unit processing power is
higher. Thus, the limited computation needed by a small size correlator means a software
approach will be cheaper overall, while the tremendous computational requirements of corre-
lators on the scale required for t he Expanded Very Large Array (EVLA) or Atacama Large
Millimetre Array (ALMA) dictate that the substantial amounts of NR E spent optimising
hardwar e are worthwhile, at least in 2006.
Software also has an advantage over hardware if the additional support required fo r
unusual or stringent VLBI experiments is impossible or impractical to implement in an
existing hardware correlator. An example of this is given in §4.3. Use of a software correlator
in these cases, even at possibly reduced efficiency, is preferable to the expense of building or
altering dedicated hardware.
A good example of the flexibility of software correlation and its trade-off with efficiency is
5
spectral resolution capability. A generic modern CPU is capable of calculating multi-million
point one-dimensional Fast Fourier Transforms (FFTs), allowing an FX style software corre-
lator utilising this CPU as a processing element to give extremely high frequency resolution:
a million spectral points across the frequency bandwidth of an observation.
Such a correlation would be computationally intensive, a s conventional CPUs are not
optimised for such operations. However, it could be carried out using exactly the same
software and hardware as is used for a generic continuum experiment. Comparison to Table
1 shows that such high spectral resolution is currently impossible on existing hardware
correlators. A number of limitations o n particular hardware correlator implementatio ns,
such as minimum integration times, maximum input data rates, and maximum output data
rates, can be overcome in a similar fashion with software correlators.
The flexibility, inexpensive nature, and ease of production of software correlators makes
them particularly useful for small to medium sized VLBI arrays, since development times
are short, costs are low, and the capabilities are high, providing niche roles for even small
facilities. These factors have led to a resurgence in software correlator applications in a
number of groups around the world. In addition to the efforts described here at the Swin-
burne University of Technology, a gr oup have developed a software correlator, mainly for
geodetic VLBI, at the Communications Research Laboratory (CRL) in Japan (Kondo et al.
2003). This CRL code is also used for real-time fringe checks during observations on the
European VLBI Network (EVN), operated from JIVE
2
. Also at JIVE, a software correlator
has been developed and used to process VLBI observatio ns that tracked the Huygens probe
as it entered the atmosphere of Titan ( Pogrebenko et al. 2003). Spacecraft tracking with
VLBI and software correlation is likely to become a more recognised technique following the
Huygens success, for example for the Chinese Chang’E lunar mission
3
. Finally, the most
ambitious example of a software correlator is the Low Frequency Array (LOFAR) correlator,
which is implemented on an IBM BlueGene/L supercomputer containing 12 ,0 00 processors
4
.
This software correlator rivals the most powerful hardware correlators currently operating
or in the design stage, but differs from the software correlator described in this pap er in that
hardwar e specific o ptimisations and large amounts of NRE were utilised.
The approach we used in the development of the software correlator was la r gely inspired
by the previous success of a group at Swinburne who developed baseband signal processing
software for multi-processor environments, fo r the purposes of pulsar studies (Bailes 2003).
2
Details about the process and results can be found at http://www.evlbi.org/evlbi/tevlb8/ tevlb8.html
3
http://en.cast.cn
4
http://www.lofar.org
6
A prot otype software correlator developed at Swinburne is described in West (2004), with
initial results described in Horiuchi et al. (2006).
In this paper we concentrate on a description of the DiFX software correlator for VLBI
developed at the Swinburne University of Technology, motivated by the factors discussed
above. This correlator has been used as part of the Australia Telescope National Facility
(ATNF) VLBI operations since 2005 and has now replaced the previously used ATNF S2
correlator. The particular architecture we have adopted (§2.1, 2.2 and 2.3), is discussed only
briefly, as the correlation algorithm has been discussed at length in the literature. §3 describes
the DiFX correlator, including the details of the software implementation, verification results
from comparisons with two established hardware correlators, and performance figures-of-
merit. We illustrate some examples of specific scientific applications that can benefit from
software correlation in §4. Finally, our conclusions are presented in §5.
2. The FX software correlator architecture
Many previous works develop in detail the theory of radio interferometry (Thompson, Moran & Swenson
1994; Thompson 1999). The reader is referred to these texts for a complete discussion of
the technique. Here we discuss the main steps used to implement the correlator architecture
(FX) that we have adopted.
A more extensive overview of correlator operations is given in Romney (1999). We
do not describe the operations at the telescopes that convert the incident electric field at
sky frequency to the filtered, down-converted, sampled, and digitised data streams that are
recorded to disk (baseband data in our terminology).
A number of the initial operations are made on the telescope-based data streams. A
number of the later operations are baseline-based. These two sets of operations are briefly
described separately and in sequence.
2.1. Antenna-based operations
2.1.1. Alignment of telescope data streams
To correlate data from a number of different telescopes, the changing delays between
those telescopes must be calculated and used to align the recorded data streams at a prede-
termined point in space (in this case the geocentre) throughout the experiment.
7
The Swinburne software correlator uses CALC 9
5
to generate a geometric delay model
(τ(t)) for each telescope in a given observation, at regular intervals (usually 1 second).
CALC models many geometric effects, including precession, nutation, ocean and atmospheric
loading, and is used by many VLBI correlators including the VLBA and JIVE correlators.
These delays are then interpolated (using a quadratic approximation) to produce accurate
delays (∆τ < 1 × 10
15
sec, compared to an exact CALC value) in double precision for any
time during the course of the observation. The estimated station clock offets and ra tes are
added to the CALC-generated geometric delays.
The baseband data for each telescope are loaded into la rge buffers in memory, and the
int erpolated delay model is used to calculate the accurate delay between each telescope and
the centre of the Earth at any given time during the experiment. This delay, rounded to
the nearest sample, is the integer sample delay. The difference between the delay and the
int eger sample delay is recorded as the antenna ba sed fractional sample delay (up to ± 0.5
sample). Note that the a lignment of any two data streams (as opposed to a data stream
alignment with the geocentre) is good to ± 1 sample.
The integer-sample delay is used t o offset the data pointer in memory and select the
data to be correlated (some number of samples which is a power of 2, starting from the time
of alignment). The fractional sample error is retained to correct the phase as a function
of frequency following alignment to within one sample, fringe rotation, and channelisation
(§2.1.3).
Once the baseband data for each telescope have been selected, they are transferred to
a processing node and unpacked from the coa r sely quant ised representa tion (usually a 2-bit
representation) to a flo ating point (single precision) representation. From this point on, all
operations in the correlator are perfor med using floating point arithmetic, in single precision
unless otherwise specified. Note that the data volume is expanded by a factor of 16 at
this point. The choice of single precision floats (roughly double the precision necessary)
wa s dictated by the capabilities of modern CPUs, which process floats efficiently. Using
sufficient precision also avoids the small decorrelation losses incurred by optimised, low
precision operations often used in hardware correlators. This is a good example of the
sacrifice of efficiency for simplicity and accuracy with a software correlator.
At this point all data streams from all telescopes are aligned to within ± 1 sample
of each other and t he fractional sample errors for each of the telescope data streams are
recorded for later use. A set number of samples from each telescope data stream have been
selected and are awaiting processing on a common processing node (e.g. a PC in a Beowulf
5
http://gemini.gsfc.nasa.gov/solve
8
cluster).
2.1.2. Fringe rotation
Fringe rotation compensates for the changing phase difference introduced by delaying
the signal from each telescope to the geocentre after it has been downconverted to baseband
frequencies. If the changing delay, τ(t), could be compensated for at sky frequency, fringe
rotation would not be required. This, however, is impractical.
The necessary fringe rotation function can be calculated at any po int in time by taking
the sine a nd cosine of the geocentric delay multiplied by the sky frequency ν
0
; it is a pplied
via a complex multiplication f or each telescope’s data stream.
Since the baseband data have already been unpacked to a floating point representation
by this stage, a floating point fringe rotation is applied which yields no fringe rotation losses,
compared, for example, to a 6.25% loss of signal to noise for three level digital fringe rotation
in a two level complex correlator (Roberts 1997).
Implemented as such, fringe rotation represents a mixing operation and will result in a
phase difference term which is quasi-stationary at zero phase (the desired term) and a phase
sum term which has a phase rate of twice the fringe rotation function, 4πν
0
τ(t). The sum
term vector averages to a (normally) negligible contribution to the correlator; for typical
VLBI fringe rates (100s of kHz) and integration times (seconds) the relative magnitude of
the unwanted contribution to each visibility point is < 10
5
. In a software correlator it
wo uld be simple to control the integration time so that the rapidly varying phase term is
int egrated over exactly an integral number of terms of phase, thus making no contribution
to the correlator output. This feature is not currently implemented in DiFX.
We have thus far described fringe rotation as a phase shift for each sample in the time
domain. If perfor med in this manner, we refer to the fringe rotation as “pre–F” ( under an
FX architecture), as it has been applied before the transformation to the frequency domain
in the channelisation process (§2.1.3 ). In this case, the geometric delay for each sample is
int erpolated using the delay model as described in §2.1.1 above.
In cases where the fringe rotation to be applied changes little from the first sample in the
FFT window to the last, a minimal amount of decorrelation is introduced by applying a single
fringe rotation for the entire window. The decorrelation can be estimated by sinc (∆φ/2),
where φ = 2πν
0
τ is the change in baseline phase due t o Earth rotation over the FFT
window.
9
In this way, fringe rotation can be applied a fter channelisation, which saves considerable
computational effort (“post–F ” fringe rotation). For this approach to be viable, the fringe
rates should be low (ie low frequencies and/or short baselines) and the number of channels
should be small (implying that the time range of the samples to be correlated is short
compared to the fringe period). Table 2 shows the degree of decorrelation which would be
incurred by utilising post–F fringe rotation for a range of VLBI observation modes. This
decorrelation is simple to calculate and could be used to correct the visibility amplitudes and
alter visibility weights, although this is not presently implemented in DiFX. It is important
to no te that t he use of post–F fringe rotation is not recommended for all situations shown
in Table 2, and indeed is o nly intended for use when the resultant decorrelation is 1 %.
Post–F fringe rotat io n is desirable in situations where the fringe rate is extremely low,
when the double-frequency term introduced by the mixing operation of pre–F fringe rota tion
is not effectively averaged to zero over the course of an integration and makes a significant
and undesirable contribution to the correlator output. Switching from pre–F to post–F
fringe rotation would be beneficial f or periods of time in most experiments when the source
traverses periods of low phase rate. Sources near a celestial pole can have very low fringe
rates for long periods of time. Alternatively, if very short correlator integration t imes are
used, the sum term may not integrate to zero when using pre–F fringe rotation. Post-F
fringe rotation would therefore be a natural choice in these circumstances.
It should be noted that it is possible to undertake the exact equivalent to pre–F fringe
rotation in the frequency domain. However, this would involve the Fourier transform of
the fringe rotation f unction and a convolution in the frequency domain, which is at least as
computationally intensive as the complex multiplication of the data and fringe rotation in
the time domain.
DiFX implements pre–F or post–F fringe rotation as a user controlled option.
2.1.3. Channelisation and fractional sample error correction
Once the data are aligned and phase corrected after fringe rotation, the time series data
are converted into frequency series data (channelised), prior t o cross multiplication.
Channelisation of the data can be accomplished using an FFT (Fast Fourier Tr ansform)
or a digital filterbank. If used, the filterbank is implemented in a polyphase fa shion, which
essentially inserts a decomposed filter before an FFT (Bellanger & Daguet 2004). This allows
the channel response to be changed from the sinc
2
response natural to a FX correlator to
any desired f unction. In practise, an approximation to a rectangle is applied, although the
10
length of the filter (and hence the accuracy of the approximation) is tunable.
If pre-F fringe rotation has been applied, the data are already in complex form, and
so a complex-to-complex FFT is used. The positive or negative frequencies are selected in
the case of upper or lower sideband data respectively. If post-F fr inge rota tion is to be
applied, the data are still real and so a more efficient real-to-complex FFT may be used.
This is possible due to the conjugate symmetry property of an FFT of a real data series. In
this case, lower sideband data may be recovered by reversing and conjugating the resultant
channels.
The final station-based operation is fractional-sample correction (Romney 1999). This
step is considerably easier in an F X correlator than an XF implementation, since the con-
version to the frequency domain before correlation allows the fractional error to be corrected
exactly, assuming the error to be constant over an FFT length. This is equivalent to the
assumption made for post-F fringe rotation, but is considerably less stringent since the phase
change is proportio nal to the subband bandwidth, rather t han sky frequency as in the case
of fringe ro tation. The frequency domain correction manifests itself as a slope in the phase
as a function of frequency across the observed bandwidth.
Thus, after channelisation, a further complex multiplication is applied t o the channels,
correcting the fractional sample error. In the case of post-F fringe rotation, the fringe
rotation value is added to the fractional-sample correction and the two steps are performed
together.
Either simple FF T or digital polyphase filter bank channelisation can be selected as a
user controlled option in DiFX.
2.2. Baseline-based operations
2.2.1. Cross multiplication of telescope data streams
For each baseline, the channelised data from the telescope pair are cross-multiplied on a
channel by channel basis (after f orming the complex conjugate for the channelised data from
one telescope) to yield the frequency domain complex visibilities that are the fundamental
observables of an interferometer. This is repeated for each common band/polarisation on a
baseline, and fo r a ll baselines. If dual polarisations have been recorded for any given band,
the cross-polarisation terms can also be multiplied, allowing polarisation information for the
target source to be recovered.
11
2.2.2. Integration of correlated output
Once the above cycle of operations has been completed, it is repeated and the resulting
visibilities accumulated (complex added) until a set accumulation time has b een reached.
The number of ”good” cycles per t elescope is recorded, which could form t he basis of a
data weighting scheme, although weights are not currently recorded in DiFX. Generally,
on each cycle the input time increment is equal to the corresponding FFT length (twice
the number of spectral points), but it is also possible to overlap FFTs. This allows more
measurements of higher lags and greater sensitivity to spectral line observations, at the cost
of increased computation. In this way, the limiting time accuracy with which accumulation
can be performed is equal to the FFT length divided by the overlap factor. A caveat to this
statement is discussed in §3.4.
2.2.3. Calibration for nominal telescope T
sys
Cross multiplication, accumulation and normalisation by the antenna autocorrelation
spectra gives the complex cross power spectrum for each baseline, representing the correlated
fraction of the geometric mean of the powers detected at each telescope. To obtain the
correlated power in units o f Jy, the cross power spectra (amplitude components) should be
scaled by the geometric mean of the powers received at each telescope measured in Jy i.e.
the T
sys
in Jy routinely measured at each antenna. Calibration based on the measured T
sys
is
typically p erfo r med as a post-correlation step in AIPS
6
or a similar data analysis package, and
so a nominal value for the T
sys
for each telescope is applied at the correlator. In addition, a
scaling factor to compensate for decorrelation due to the coarse quantisation of the ba seband
data is applied. This corrects the visibility amplitudes, but of course cannot recover the lost
signal to noise. For the 2-bit data typically processed, this scaling factor is 1/0.88 in the
low-correlation limit (Cooper 1970). The relationship becomes non-linear at high correlation
and the scaling factor approaches unity as t he correlation coeffient approaches unity. The
correction for high-correlation cases can be applied in post-processing, generally at the same
time as the application of measured T
sys
values.
6
http://www.aoc.nrao.edu/aips
12
2.2.4. Export of visibility data
Once an accumulation interval has been reached, the visibilities must be stored in a
useful f ormat. Presently, the software correlator supports RPFITS
7
as the output format.
RPFITS files can be loaded into ana lysis packages such as AIPS, CASA
8
, or MIRIAD
9
for data reduction. Ancillary information is included in the RPFITS file along with the
complex visibilities, time stamps, and (u,v,w) coor dinates. The RPFITS standard supports
the appending of a data weight to each spectral point, but DiFX does not currently record
weights. In the future, it is planned t o add additional widely used output formats, such as
FITS-IDI
10
.
2.3. Special processing operations: pulsar binning
Pulsed signals are dispersed as they travel through the interstellar medium (ISM), re-
sulting in a smearing of the pulse arrival time in frequency. In order to correct for the
dispersive effects of the ISM, DiFX employs incoherent dedispersion (Voˆute et al. 2002).
This allows the visibilities generated by the correlator to be divided into pulse phase bins.
Unlike hardware correlators which typically allow only a single on/ o bin, or else employ
2
N
bins of fixed width, DiFX allows an arbitrary number o f bins placed a t arbitary phase
int ervals. The individual bins can be written out separately in the RPFITS file format to
enable investigation of pulse phase dep endent effects, or can be filtered within the correlator
based on a priori pulse profile information.
To calculate which phase bin a visibility at a given frequency and time corresponds
to, the software correlator requires information on the pulsar’s ephemeris, which is supplied
in the form of one o r more “p olyco” files containing a polynomial description of apparent
pulse phase as a function o f time. These are generated using the pulsar analysis program
TEMPO
11
, and require prior timing of a pulsar. Additional software has been written by
the authors to verify the pulsar timing, using the generated polyco files and the baseband
data (in MkV, LBA or K5 format) fr om an experiment, allowing phase bins to be accurately
7
http://www.atnf.csiro .au/computing/software/rpfits.html
8
http://casa.nrao.edu/
9
http://www.atnf.csiro .au/computing/software/miriad
10
http://www.aoc.nrao.edu/aips/FITS-IDI.html
11
http://pulsar .princeton.edu/tempo/reference
manual.html
13
set before correlation.
For VLBI observations of pulsars, it is usually desirable to maximise the signal to noise
of the observations by binning the visibilities based on the pulse phase, and applying a filter
to the binned output based on the signal strength in tha t phase. Typically this filter is
implemented as a binary on/off for each phase bin. Using the pulse profile generated from
the baseband data of an observation, however, DiFX allows a user-specified number of bins to
be g enerated and a filter applied ba sed on pulse strength × bin width, allowing the maximum
theoretical retrieval of signal, as described below. This also reduces the output data volume,
since only an integrated on-pulse” visibility is retained, rather than potentially many phase
bins.
Consider observing a single pulse, divided into M equally spaced phase bins. Let the
pulsar signal strength as a function of phase bin be S(m), and the noise in single phase bin
to be Z ×
M, where Z is the baseline sensitivity f or an integration time of a single pulse
period. When all bins ar e summed (effectively no binning), the S/N ratio will be:
P
M
m=0
S(m)
Z
(1)
as the signal adds coherently while the noise adds in quadrature. For a simple on/off gate
accepting only bins m
1
to m
2
, the S/N ratio will be:
P
m
2
m=m
1
S(m)
r
P
m
2
m=m
1
Z ×
p
(M)
2
(2)
Finally, for the case where each bin is weighted by the pulse signal strength in that bin,
the S/N ratio will be:
P
M
m=0
(S(m))
2
r
P
M
m=0
S(m) × Z ×
p
(M)
2
(3)
For a Gaussian shaped pulse, this allows a modest improvement in recovered signal to
noise of 6% compared to an optimally placed single on/off bin. On a more complicated
profile, such as a Gaussian main pulse with a Gaussian interpulse at half the amplitude, the
improvement in recovered signal to noise increases to 21 %.
14
3. Software cor relation on the Swinburne Beowulf cluster - a case study
3.1. The cluster computing environment
The Swinburne University of Technology supercomputer is a 300 processor Beowulf
cluster, that is a mixture of commodity off-the-shelf desktop and server style PCs, connected
via a gig abit ethernet network. In particular, the supercomputer has five sub-clusters, each
with 48 machines. Four sub-clusters are made up of single processor 3.2 GHz, Pentium 4 PCs
with 1 GB of RAM per machine, while one sub-cluster is made up of dual processor Xeon
servers, each with 2 GB of RAM per machine. The cluster is continuously upgraded and
fully replaced approximately every 3–4 years. The software correlation code must operate in
this multi-user, multi-tasking, and highly dynamic environment.
3.2. Structure of the DiFX code
DiFX is written in C++, but makes heavy use of the optimised vector processing routines
provided by the Intel Perfor mance Primitive (IPP) library
12
. The use of this o ptimised vector
library results in a factor of several performance gain on the Intel CPUs, compared to non-
optimised vector code. Data transfer is handled via the Message Passing Interface (MPI)
standard
13
. The mpich implementation of MPI is used
14
.
Figure 1 shows the high-level class structure of DiFX, along with the data flow. The
correlation is managed by a master node (FxManager), which instructs da ta management
nodes (Datastream) to send time ranges of baseband data to processing nodes (Core). The
data are then processed by the Core nodes, and the results sent back to the FxManager.
Double buffered, non-blocking communication is used to avoid latency delays and maximise
throughtput. Both the Datastream and Core classes can be (and have been) extended to
allow maximum code re-use when handling different data formats and processing algorithms.
The Core nodes make use of an allocatable number of threads to maximise performance on
a heterogenous cluster.
The Datastream nodes can read the baseband data into their memory buffers from a local
disk, a networ k disk or a network socket. Once the data are loaded into the datastream buffer,
the remainder of the system is unaware of its origin. This is one of the most powerful aspects
12
http://www.intel.com/cd/softwa re/products/asmo-na/eng/perflib/ipp/index.htm
13
http://www-unix.mcs.anl.gov/mpi/
14
http://www-unix.mcs.anl.gov/mpi/mpich1/
15
of this correlator architecture, meaning the same correlator can easily be used for production
disk-based VLBI correlation and real-time eVLBI t esting, where the data is transmitted in
real time from the telescopes to the correlator over optical fibre. Real-time eVLBI operational
modes have been tested using DiFX, transmitting data in real-time from the three ATNF
telescopes (Parkes, ATCA, and Mopra) to computing resources at the Swinburne University
of Technology and the University of Western Australia in Perth (a Cray XD-1 utilising
Opteron processors and on-board Xilinx FPGAs). The software correlator then correlates
the transmitted data in real-time. A full account of the new eVLBI capabilities of the
Australian VLBI array will be presented elsewhere (Phillips et al. 2007, in preparation).
3.3. Operating DiFX
DiFX is controlled via an interactive Graphical User Interface (GUI), which calls the
various component programs and helper scripts. The primary purpose of the GUI is to
facilitate easy editing of the text files which configure t he correlator, run external programs
such as the delay model generator, and provide feedback while a job is running. Two files
are necessary to run the actual correlator program. The first is an experiment configuration
file, containing tables of stations, frequency setups, etc, analo gous to a typical hardware
correlator job configuration script. The second file contains the list of compute nodes on
which the correlator program will run.
While it is po ssible to run all tasks required to operate the correlator manually, in prac-
tise they are orga nised via the GUI. This consists of running a series of helper applications
from the GUI to generate the necessary input for the correlator. These include a script to
extract experiment information from the VLBI exchange (VEX) file used to configure and
schedule the telescopes at observe t ime, a delay and (u,v,w) generator which makes use of
CALC 9, and scripts to extract the current load of available nodes. Pulsar–specific info r ma-
tion such as pulse profiles and bin settings can also be loaded. This information is presented
via the GUI and adjustments to the configuration, such as selection computational resources
to be used, can be made before launching a correlation jo b.
In the future it is planned to incorporate some real-time feedback of a mplitude, phase
and lag information from the current correlation via the GUI. This would be similar to the
visibility spectra displays available continuously at connected-element int erferometers.
16
3.4. Performance
In order to keep every compute node used in the correlation fully lo aded, they must be
kept supplied with raw data. If this condition is satisfied, we have a CPU-limited correlation,
and the addition o f further nodes will result in a linear performance gain. In practise,
however, at some point o bta ining data from the data source (network socket or disk) and
transmitting it across the local network to the processing nodes will no longer occur quickly
enough, and the correlation becomes data-limited rather than CPU-limited. Correct selection
of correlation parameters, and good cluster design, will minimise the networking overhead
imposed on a correlation job, and ensure that all compute nodes are fully utilised. This is
discussed in §3.4.1 below, and performance profiles for the CPU-limited case are presented
in §3.4.2.
3.4.1. Networking considerations
As described in §3.2, double-buffered communications to the processing nodes are used
to ensure that nodes ar e never idle as long as sufficient aggregate networ king capability is
available. The use of MPI communications adds a small but unavoidable overhead to data
transfer, meaning the maximum throughput of the system is slightly less than the maximum
network capacity on the most heavily loaded data path.
There ar e two significant data flows: out of each Datastream and into the FxManager.
For any high speed correlation, there will be more Core nodes than Datastream nodes, so
the aggregate rate into a Core will be lower than that out of a Data stream. The flow out of
a Core is a factor of N
cores
times lower than that into the FxManager node.
If processing in real time (when processing time equals o bservation time), the rate
out of each Datastream will be equal to the recording rate, which can be up to 1 Gbps with
modern VLBI arrays and is within the capabilities o f modern commodity ethernet equipment.
The rate into the FxManager node will be equal to the product of the recording rate, the
compression ratio, and the number of Cores, where the compression ratio is the ratio of data
int o a Core to data out of a Core. This is determined by the number of antennas (since
number of baselines scales with number of antennas squared), the number of channels in
the output cross-power spectrum, the number of polarisation products correlated, and the
int egration t ime used before sending data back to t he FxManager node.
It is clearly desirable to maximise the size of data messages sent to a core for processing,
since this minimises the data rate into the FxManager node for a given number of Cores.
However, if the messages are too large, performance will suffer as RAM capacity is exceeded.
17
Network latency may also become problematic, even with buffering. Furthermore, it should
be apparent tha t in this architecture, the Cores act as short-term accumulators (STAs),
with the manager performing the long term accumulation. The length of the STA sets
the minimum integration time. It is important to note, however, that the STA interval is
entirely configurable in the software correlator, to be as short as a single FFT, although
network bandwidth and latency are likely to be limiting factors in this case.
For the majority of experiments it is possible to set a STA length which satisfies all the
network criteria and allows the Cores to be maximally utilised. For combinations of large
numbers of antennas and very high spectral and time resolution, however, it is impossible
to set an STA which allows a satisfactorily low return data rate to the FxManager node. In
this case, real time processing of the experiment is not possible without the installation of
additional network and/or CPU capacity on the FxManager node.
It is important to emphasise that although it is possible to find experimental configura-
tions for which the software correlator suffers a reduction in perfo r mance, these configurations
wo uld be impossible on existing hardware correlators. If communication to the FxManager
node is limiting perfo r mance, it is also possible to parallelize a disk-based experiment by
dividing an experiment into several time ranges and processing these time ranges simultane-
ously, allowing an aggregate processing rate which equals real time. This is actually one o f
the most powerful aspects of the software correlator, and one which would allow scheduling
of correlation to always ensure the cluster was being fully utilised.
3.4.2. CPU-limited performance
Figure 2 shows the results of performance testing on the Swinburne cluster (using the
3.2 GHz Pent ium 4 machines and the giga bit ethernet network) for different array sizes and
spectral resolutions. The results shown in Figure 2 were obtained for data for which the
aggregate bandwidth was 64 MHz, broken up into 8 bands each of 8 MHz bandwidth (4 ×
dual polarisation 8 MHz bands: data were 2-bit sampled: antenna data rate 256 Mbps).
Node requirements for real-time operation are extrapolated from the compute time on an
8 node cluster. The correlation integration time is 1 second and all correlations provide all
four polarisation products. RAM requirements per node ranged from 10 50 MB depending
on spectral resolution, showing that large amounts of RAM are unnecessary for typical
correlations. It can be seen that even a modestly sized commodity cluster can process a
VLBI-sized arr ay in real time at currently ava ila ble data rates.
18
3.5. Correlator comparison results
3.5.1. Comparison with ATNF S2 correlator
Observations to provide data for a correlator comparison between the Swinburne soft-
wa r e correlator and the ATNF S2 correlator were undertaken on March 12, 2006, with the
following subset of the LBA: Parkes (64 m), ATCA (phased array of 5 × 22 m), Mopra (22
m), Hobart (26 m).
Data from these observations were recorded simultaneously to S2 tapes and the L BADR
disks (Phillips et al. 2 007, in preparation) during a 20 minute period, UT 02:30–02:50, cor-
responding to a scan on a bright quasar (PKS 0208512). The data recorded corresponded
to two 16 MHz bands, right circular pola r isation (RCP), in the f requency r anges 2252
2268 MHz and 2268 2284 MHz.
The data recorded on S2 tapes were shipped to the ATNF LBA S2 correlator (Roberts
1997) a t ATNF headquarters and processed. The data recorded to LBADR disks were
shipped t o the Swinburne University of Technology supercomputer and processed using the
software correlator.
At both correlators ident ical T
sys
values in Jy were specified for each antenna and applied
in order to produce nominally calibrated visibility amplitudes. Further, both correlators used
identical clock models, in the form of a single clock offset and linear rate as a function of time
per antenna. Finally, the data were processed at each correlator using 2 second correlator
int egration t imes and 32 spectral channels across each 16 MHz band.
Different implementations of the CALC-based delay generation were used a t each corre-
lator, meaning small differences exist in the delay models used, leading t o differences in the
correlated visibility phase. We have calculated the delay mo del differences and subtracted
the phase due to differential delay model in the following discussion.
From both correlators, RPFITS fo r mat data were output and loaded into the MIRIAD
software (Sault, Teuben, & Wright 1995) for insp ection and analysis. The data from the two
correlators are compared in a series of F ig ures below (Figures 3 5).
Figure 3 shows the visibility amplitudes f or all baselines from both correlators as a
function of time, over the period 02:36:00 - 02:45:00 UT, for one of the 16 MHz bands (2252
2268 MHz). These amplitudes represent the vector averaged data over the frequency
channel range 10 21 (to avoid the edges of the band). The data for each baseline were
fit to a first order polynomial model (S(t) =
dS
dt
t + S
0
, where S is the flux density in Jy, t
is the o ff set in seconds from UT 02 :4 0:30, and S
0
is the extrapolated flux density at time
19
UT 02:40:30, using a standard linear least squares routine. The root mean square (RMS)
variation around the best fit model was calculated for each baseline. The fitted models are
shown in Figure 3 and show no significant differences between the S2 correlator and the
software correlator. Further, the calculated RMS for each baseline agrees very well between
DiFX and the S2 correlator, as summarised in Table 3.
Figure 4 shows the visibility phase as a function of time for each of the six baselines in
the array. Again the data represent the vector averaged correlator output over the frequency
channel range 10 21 within the 2252 2268 MHz band. As discussed above, small
differences between the delay models used at each correlator have been taken into account
as part of this comparison.
Figure 5 shows a comparison of the visibility amplitudes and phases as a function of
frequency in the 2252 2268 MHz band. The data represented here result from a vector
average of the two datasets over a two minute time range, UT 02:40:00 02:42:00. Since
the S2 correlator is an XF style correlator, it cannot exactly correct fractional sample error
in the same manner as an FX correlator such as DiFX, as the channelisation is performed
after accumulatio n. The coarse (post-accumulation) fractional sample correction leads to
decorrelation at all points except the band center, up to a maximum of 10% at the band
edges on long baselines where the geometric delay changes by a sample or more over an
int egration period. We have corrected for this band edge decorrelation in the S2 correlator
amplitudes in Figure 5.
3.5.2. Comparison with the VLBA correlator
Data obtained as part of a regular series of VLBA test observations were used as a
basis for a correlator comparison between the software correlator and the VLBA correlator
(Napier et al. 1994). The observations were made on 2006 August 05 using the Brewster,
Los Alamos, Mauna Kea, Owens Valley, Pie Town, and Saint Croix VLBA stations. One bit
digitised data sampled at the Nyquist rate for four dual polarisation bands, each of 8 MHz
bandwidth, were recorded using the Mk5 system (Whitney 2003). The four bands were at
centre frequencies of 22 79.49, 2287.49, 2295.49, and 2303.49 MHz. The exp eriment code for
the observations was MT628 and the source observed was 092 3+392, a strong and compact
active galactic nucleus. Approximately two minutes of data recorded in this way was used
for the comparison.
The Mk5 data were correlated on the VLBA correlator and exported to FITS f ormat
files. The data were also shipped to the Swinburne supercomputer and correlated using the
20
software correlator, the correlated data exported to RPFITS format files. In both cases, no
scaling of the correlated visibility amplitudes by the system temperatures were made at the
correlators. The visibilities remained in the form of correlation coefficients for the purposes
of the comparison i.e. a system temperature of unity was used to scale the a mplitudes. Each
8 MHz band was correlated with 64 spectral points, and an integration time of 2.048 seconds
wa s used.
The VLBA correlator data were read into AIPS using FITLD with the parameter
DIGICOR=1. The DIGICOR parameter is used to apply certain scalings to the visibil-
ity amplitudes for data from the VLBA correlato r . Further, to obtain the most accurate
scaling of the visibility amplitudes, the task ACCOR was used to correct for imperfect sam-
pler thresholds, deriving corrections to the antenna-based a mplitudes of 0.5%. These
ACCOR corrections were applied to the data and the data were written to disk in FITS
format.
The software correlator data were read directly into AIPS and then written to disk in
the same FITS format as the VLBA correlator data. No corrections to amplitude or phase
of the software correlated data were made in AIPS.
The VLBA correlator data and the software correlator data were b oth imported into
MIRIAD for inspection and analysis, using the same software as used for the comparison
with the LBA correlator described above. RCP from the 2283.49 2291.4 9 MHz band over
the time range UT 17:49:00 17:51:00 was used in all comparison plots below.
Since the delay models used by the VLBA and software correlators differ at the pi-
cosecond level, as is the case for the comparison with the LBA data in §3.5.1, differences in
the visibility phase exist between the correlated datasets. As with the LBA comparison, we
have compensated for the phase error due to the delay models differences in the following
comparison.
Figure 6 shows the visibility amplitudes f or all baselines from both correlators as a
function of time. These amplitudes represent the vector averaged data over the frequency
channel range 10 55 (to avoid the edges of the band). The data for each baseline were fit to
a first order polynomial model (S(t) =
dS
dt
t+S
0
, where S is the correlation coefficient, t is the
offset in seconds from UT 17:50:00, and S
0
is the extrapolated correlation coefficient at time
UT 17:5 0:00) using a standard linear least squares routine. The root mean square (RMS)
variation around the b est fit model was calculated for each baseline. The fitted models
are shown in Figure 6 and show no significant differences between the VLBA correlator
and the software correlator. Further, the calculated RMS for each baseline agrees very well
between the VLBA correlator and the software correlator. The results of the comparison are
21
summarised in Table 4.
Figure 7 shows the visibility phase as a function of time fo r each of the fifteen baselines in
the array. Again the data represent the vector averaged correlator output over the frequency
channel range 1 0 55 within the band. As discussed above, small differences between the
delay models used at each correlator cause phase offsets between the two correlators, and
have been taken into account as part of this comparison.
Figure 8 shows a comparison of the visibility amplitudes and phases as a function of
frequency in the band. The data represented here result from a vector average of the two
datasets over a two minute time ra nge. Figures 6, 7 and 8 show that the results obtained
by the VLBA correlator and DiFX agree to within the RMS errors of the visibilities in each
case, as expected.
4. Scient ific applications of t he Swinburne software correlator
4.1. High frequency resolution spectral line VLBI
As mentioned in the introduction, an attractive feature of software correlation is the ease
with which very high spectral resolution correlation can be undertaken. This is particularly
useful for studies of spectral line sources such as masers when mapping the distribution of the
masing regions and their kinematics i.e. near black holes in galactic nuclei ( Greenhill et al.
1995).
Figure 9 shows a spectrum obtained from an LBA observation of the OH maser G3450.2.
These observations were made with an array consisting of the ATCA (phased array of 5 × 22
m), Parkes (64 m), and Mopra (22 m), recording data from a dual-polarised (RCP and LCP)
4 MHz band onto hard disk. The data were correlated using the software correlato r with
16,384 frequency channels across the 4 MHz band, corresponding to 0.25 kHz per channel
or 0.038 km/s velocity resolution at 1 .7 2 GHz.
These results compare with recent very high spectral resolution work done with the
VLBA. Fish et a l. (2006) o bserved OH masers with the VLBA, using a 62.5 kHz bandwidth
and 512 channels across this band to obtain channel widths of 0.122 kHz or 0.02 km/s velocity
resolution. The velocity resolution of this correlated dataset is almost twice as good as that
shown in Figure 9 . However, the VLBA bandwidth is only 0.016 times the bandwidth of the
observations shown in Figure 9.
If required, DiFX could have correlated these data with 32,768 channels, 65,536 channels
or even higher numbers of channels. As mentioned in the introduction, the only penalty is
22
compute time on a resource with a fixed number of processing elements. DiFX therefore has
a clear advantage over existing hardware correlators in terms of producing very high spectral
resolution over wide bandwidths. This capability is useful if the velocity distribution of an
ensemble of masers in a field is broa d and cannot be contained in a single narrow bandwidth.
4.2. Correlation for wide fields of view
An application t hat takes advantage of the frequency and time resolution of the software
correlator output is wide field imaging. To image a wide field of view, avoiding the effects
of time and bandwidth smearing, high spectral and temporal resolution is required in the
correlator visibility output. For example, at VLBI resolution (40 mas), to image the full
primary beam of an Australia Telescope Compact Array (ATCA) antenna (22 m diameter)
at a frequency of 1.4 GHz, requires a time resolution of the correlator output of 50 ms and a
frequency resolution of 4 kHz (allowing a 0.75 % smearing loss at the F WHM of the primary
beam).
Neither the JIVE nor the VLBA har dware correlators can achieve such high frequency
or t ime resolution for continuum experiments, but DiFX can b e configured for such modes
in an identical manner to a normal continuum experiment.
4.3. Pulsar studies
As compact sources with high velocities, pulsars make excellent testb eds with which
to pro be the structure of the interstellar medium (ISM). Scintillation due to structure in a
scattering screen between the observer and the pulsar causes va r ia t io ns in the interferometric
visibilities, which have some dependence on time and frequency (e.g. Hewish et al. 1985).
Naturally, pulsar binning is advantageous in these studies for maximising signal to noise
ratios.
The most stringent requirement for useful studies of pulsar scintillation, however, is that
of extremely high frequency resolution. Brisken et a l. (2007, in preparation) have recently
demonstrated the capabilities of DiFX for this type of analysis with observations of the
pulsar B083404. The NRAO Green Bank Telescope (100 m), Westerbork (14 × 25 m),
Jodrell Ba nk (76 m), and Arecibo (305 m) were used to provide an ultra-sensitive array at
327 MHz. The data were recorded using the Mk5 system and correlated on the Swinburne
software correlator. The main requirement on the correlation was 0.25 kHz wide frequency
channels, over the broadest bandwidth available, to maximise signal to noise. For these
23
observations a 32 MHz band wa s available. The Swinburne software correlator therefore
correlated the data with 131,072 frequency channels across the band.
No existing hardware correlator can provide such a high frequency resolution over such a
wide bandwidth. Full details of the interpretation o f the B0834 04 software correlated data
will be ava ila ble in Brisken et al. (2007, in preparation). Shown in Figure 10 is a section
of the dynamic spectrum from this observatio n which shows the scintillation structure as
functions of time and frequency.
4.4. Geodetic VLBI
In addition to astronomical VLBI, the software correlator can also be deployed for
geodetic VLBI. Compared to astronomical VLBI, geodetic VLBI has additional requirements,
including different output formats a nd the frequent use of sub-arraying. The flexibility and
capabilities of the software correlator ar e well-matched to this ta sk.
The software correlator has been tested on geodetic datasets obtained using the Mk5
recording system, consisting of 16 frequency bands. These tests form the basis of a geodetic
correlation comparison between the software correlator and the geodetic correlator of the
Max Plank Institut of Radioastronomie in Bonn, Germany. Full results of this correlator
comparison will be reported elsewhere (Tingay et al. 2007, in preparation).
In particular, in Australia a new three-station geodetic VLBI array has been funded
as part of the geospatial component of the Federal Government’s National Collab orative
Research Infrastructure Scheme (NCRIS). This scheme provides for three new g eodetic VLBI
stations of 12 m diameter, Mk5 recording systems, and a modified version of t he software
correlator described in this paper. The modifications necessary to convert DiFX into a
geodetic correlato r consist o f the addition of phase calibration tone extraction, a streamlined
int erfa ce to scan-by-scan correlation for sub-arraying, and a capability to produce visibilities
in a format convenient for geodetic post-processing.
The new Australian geodetic VLBI array will participate in global geodetic observations,
as well as undertaking experiments internal to the Australian tectonic plate.
5. Conclusions
In this paper we have outlined the main benefits of software correlation for small to
medium sized VLBI arrays. They are:
24
The development of software correlation is rapid and does not depend on an intimate
knowledge of digital signal processing har dware, just the algorithms;
The software is flexible and scalable to accommo dat e a very broad ra nge of int erfero-
metric modes of observation, including many which cannot be supported by existing
ASIC-based hardware correlators. Software correlators are therefore ideal for novel
experiments with very special requirements. The main trade-off for improved perfor-
mance with a software correlator is the increase in compute time for a fixed number of
processing elements, or the addition of extra processing elements;
The software can easily incorporate data recorded using mixed disk-based recording
hardwar e;
Medium to large multi-processor computing facilities are available at almost all uni-
versity and government research institutions, allowing users easy entry into VLBI cor-
relation;
The correlation algorithm is highly parallel and very well suited to a parallel multi-
processor computing environment;
The cost of commodity computing continues to fall with time, making large parallel
computing facilities more powerful and less expensive;
Once written, the code can be port ed to a wide r ange of platforms and recompiled
with minimal effort.
We have discussed the implementation of the DiFX software correlator on a standard
Beowulf cluster at the Swinburne University of Technology and have provided performance
figures-of-merit for this implementation, showing that relatively large numbers of telescopes
and relatively high data r ates can be correlated in “ real-time” using numbers of machines
that do no t exceed the capabilities of moderate to large Beowulf clusters. Clear trade-offs are
possible in many areas of performance. For example, if r eal-time operation is not important
it is possible to dramatically reduce the number of processing elements.
We have also showed the results of comprehensive testing of the software correlator,
comparing it output to that of two established hardware correlators, the S2 correlator of
the Australian Long Baseline Array, operated by the ATNF, and the VLBA correlator. The
correlator comparisons of visibility amplitude and phase as functions of time and frequency
verify that DiFX is operating correctly for astronomical VLBI observations.
DiFX now supports all Australian VLBI observations and some global VLBI experi-
ments, at dat a rates up to 1 Gbps per telescope. The DiFX code can be downloaded from
25
http://astronomy.swin.edu.au/~adeller/software/difx/. A number of scientific pro-
grams have already been supported by the software correlator and are briefly discussed here.
Further, a modified version o f the software correlator will be used to support a new VLBI
array in Australia, dedicated to local and global geodetic observations.
This work has been supported by the Australian Federal Government’s Major National
Research Facilities pro gram, the Australian Research Council’s (ARC) Strategic Research
Initiatives (eResearch) scheme, and the ARC’s Discovery Projects scheme. ATD is supported
via a Swinburne University of Technology Chancellor’s Research Scholarship and a CSIRO
postgraduate scholarship. The Long Baseline Array is part of the Australia Telescope which
is funded by the Commonwealth of Australia for operation as a National Facility managed
by CSIRO. The National Radio Astronomy Observatory is a facility of the National Science
Foundation operated under cooperative agr eement by Associated Universities, Inc. We wish
to thank Walter Brisken for kindly making available Figure 10 prior to publication, the
NRAO (Walter Brisken, Craig Walker, Jon Romney) for making available data for the VLBA
correlator comparison and Ga ry Scott for correlating the LBA S2 data for the comparison
with the ATNF S2 correlator.
REFERENCES
Bailes, M. 2003, ASP Conf. Ser. 302: Radio Pulsars, 302, 57
Bare, C. et al. 19 67, Science, 157, 18 9
Bellanger, M. & Daguet, J., IEEE Trans. Commun. Com-22(9), 11991205
Brisken, W. et al. 2007, in preparation
Carlson, B.R. et al. 19 99, PASP, 111, 1025
Casse, J.L. 1999, New Ast. Rev., 43, 503
Clark, B.G. 1973 , Proc. IEEE, 61, 1242
Clark, B.G., Cohen, M.H. & Jauncey, D.L. 1967 , ApJ, 149, L15 1
Cooper, B. F. C. 1970, Australian Journal of Physics, 23, 521
2000, Radio Int erferometry: The Saga and the Science, Proceedings of a Symposium Hon-
oring Barry Clark at 60, ed. D. G. Finley & W. M. Goss, NRAO Workshop Number
27, Associated Universities Inc.
26
Harp, G.R. 2002, in Advanced Telescope and Instrumentatio n Control Software II. eds L.
Hilton. Proceedings of the SPIE, vol 4848, 1
Hewish, A., Wolszczan, A., & Graham, D. A. 1985, MNRAS, 213, 167
Horiuchi, S. et al. 2006, ApJS(submitted)
Horiuchi, S. et al. 2000, Advances in Space Research, 26, 625
Greenhill, L.J. et al. 1995, ApJ, 440, 619
Kondo, T. et al. 2003, in New technologies in VLBI, ASP Conf. Ser., Vol. 306. ed Y.C. Minh,
San Francisco, CA: Astronomical Society of the Pacific
Moran, J.M. et al. 19 67, Science, 157, 67 6
Napier, P.J., Bagri, D.S., Clark, B.G., R ogers, A.E.E., Romney J.D., Thompson, A.R. &
Walker, R.C., Proc. IEEE, 82, 658
Phillips, C. et al. 2007, in preparation
Pogrebenko, S. 2003, in Workshop on Planetary Probe Atmospheric Entry and descent
Traj ectory Analysis and Science, ed A. Wilson, ESA
Roberts, P.P. 1997, Astron. Astrophys. Suppl. Ser., 126 , 379
Rogers, A.E.E. et al. 1983, Science, 219, 51
Romney, J. D. 1999, ASP Conf. Ser. 180: Synthesis Imaging in Radio Astronomy II, 18 0, 57
Sault, R.J., Teuben, P.J. & Wright, M.C.H. 1995, ASPC, 77, 433
Tingay, S.J. et al 20 07, in preparation
Thompson, A.R., Moran, J.M. & Swenson, G.W. 1994, Interferometry and Synthesis in
Radio Astronomy, Kreiger Publishing Company
Thompson, A. R. 199 9, ASP Conf. Ser. 180: Synthesis Imaging in Radio Astronomy II, 180,
11
Voˆute, J.L.L. et al. 2002, A&A, 385, 73 3
West, C. 2004, M.Sc. thesis, Swinburne University o f Technology
Whitney, A.R. 2003, ASPC, 306, 123
27
Whitney, A.R. 2 002, in Proceedings of the 6t h European VLBI Network Symposium, eds.
Ros, E., Porcas, R.W., L obanov, A.P., & Zensus, J.A., 41
Whitney, A.R. 1993, in IAU Symp. 156: Developments in Astrometry and their Impact on
Astrophysics and Geodynamics, 151
Wietfeldt, R. et al. 1996, IEEE Transactions on Instrumentation and Measurement, 45(6),
923
Wilson, W., Roberts, P., Davis, E. 1996, in Proceedings of the 4th APT Workshop, ed E.
King, 16
This preprint was prepared with the AAS L
A
T
E
X macros v5.2.
28
Table 1. Comparison of existing hardware correlator parameters
Correlator Type Maximum telescopes Maximum channels Minimum integration time Maximum input data rate Maximum output data rate Pulsar binning
(in one correlator pass) (per baseline) (ms) (Mbps) (MB/s)
VLBA
A
FX 20 2048 131.072 256 1 yes
JIVE
B
XF 16 2048
C
125
D
1024 6
E
no
ATNF S2
F
XF 6 8192
G
2000 128 0.064 yes
A
http://www.vlba.nrao.edu/astro/obstatus/current/node28.html
B
http://www.jive.nl/correlator/status.html
C
for up to 8 telescopes
D
when using half the correlator
E
data in lag space
F
http://www.atnf.csiro.au/vlbi/correlator/
G
0.5 MHz bandwidth, 2 products
29
Table 2. Maximum decorrelation incurred due to “Post-F” fringe rotation
Observation Max. baseline Frequency # channels/16MHz band Max. decorrelation
(km) (MHz) (%)
LBA low frequency continuum 1400 1600 128 0.003
LBA high frequency continuum 1700 8400 128 0.13
VLBA low frequency continuum 8600 1600 128 0.12
VLBA high frequency continuum 8600 22200 128 21.1
LBA water masers 1700 22200 1024 47.6
30
Table 3. Linear fit parameters for visibility amplitude vs time for DiFX and the LBA S2
correlator, with 95% confidence limits
Baseline Offset
DiFX
(Jy) Offset
LBA
(Jy) Slope
DiFX
(µJy s
1
) Slope
LBA
(µJy s
1
)
PKS - NAR 1.341 ± 0.030 1.343 ± 0.028 10 ± 13 14 ± 12
PKS - MOP 3.185 ± 0.058 3.185 ± 0.063 14 ±24 11 ± 26
PKS - HOB 2.307 ± 0.058 2.293 ± 0.061 12 ± 24 6 ± 24
NAR - MOP 1.616 ± 0.109 1.619 ± 0.114 27 ± 43 10 ± 45
NAR - HOB 1.142 ± 0.111 1.139 ± 0.116 3 ± 44 5 ± 46
MOP - HOB 2.694 ± 0.256 2.681 ± 0.257 18 ± 101 56 ± 101
31
Table 4. Linear fit parameters for visibility amplitude (in units of correlation coefficient)
vs time for DiFX and the VLBA correlator , with 95% confidence limits
Baseline Offset
DiFX
Offset
VLBA
Slope
DiFX
(s
1
× 10
6
) Slope
VLBA
(s
1
× 10
6
)
BR - LA 0.0104 ± 0.0004 0.0103 ± 0.0005 0.8 ± 1.7 0.9 ± 1.7
BR - MK 0.0072 ± 0.0005 0.0071 ± 0.0006 0.1 ± 1.8 0.5 ± 2.0
BR - OV 0.0125 ± 0.0005 0.0124 ± 0.0005 0.7 ± 1.7 0.5 ± 1.8
BR - PT 0.0090 ± 0.0004 0.0089 ± 0.0004 1.0 ± 1.3 1.2 ± 1.5
BR - SC 0.0069 ± 0.0005 0.0069 ± 0.0005 3.1 ± 2.0 2.5 ± 1.8
LA - MK 0.0059 ± 0.0005 0.0059 ± 0.0005 1.9 ± 1.7 1.4 ± 1.7
LA - OV 0.0101 ± 0.0005 0.0100 ± 0.0005 0.4 ± 1.7 0.6 ± 1.7
LA - PT 0.0073 ± 0.0005 0.0 072 ± 0.0005 0.3 ± 1.7 0.5 ± 1.8
LA - SC 0.0058 ± 0.0004 0.0058 ± 0.0004 1.8 ± 1.5 1.9 ± 1.5
MK - OV 0.0078 ±0.0004 0.0077 ± 0.0005 0.9 ± 1.5 0.3 ± 1.8
MK - PT 0.0044 ± 0.0004 0 .0 044 ± 0.0004 0.6 ± 1.7 0.3 ± 1.5
MK - SC 0.0028 ± 0.0005 0.0028 ± 0.0005 0.6 ± 1.8 0.7 ± 1.7
OV - PT 0.008 3 ± 0.0005 0.0082 ± 0 .0 005 1.8 ± 1.8 1.9 ± 1.7
OV - SC 0.0062 ± 0.0005 0.0062 ± 0.0005 0.3 ± 1.8 0.2 ± 1.8
PT - SC 0.0055 ± 0.0005 0.0055 ± 0.0005 1.7 ± 2.0 1.3 ± 1.8
32
Fig. 1.— Overview of the software correlator architecture. Data is loaded into memory from
a disk or network connection by Datastream nodes. These nodes are directed by a Master
node to send data from given time r anges (typically several ms) to the processing elements
(Core nodes). The processed data are sent to the master node for long-term accumulation
and storage on disk.
33
Fig. 2.— Benchmark data showing the computational requirements of DiFX to correlate in
real-time, as described in the text. The nodes are single core 3.2 G Hz Pent ium processors
with 1 GB RAM, and in both benchmarks 64 MHz of total bandwidth per station was
correlated with a 1 second integration period. Top panel shows the scaling of computational
requirements with number of antenna, using 256 spectral po ints per 8 MHz subband. Bottom
panel shows the scaling o f computional r equirements with spectral points per subbband for
a ten station array.
34
Fig. 3.— S2 (red) and DiFX (black) visibility amplitude vs time for the 2252 2268 MHz
band on the source PKS 0208512, as described in the text (PKS = Parkes; MOP = Mopra;
HOB = Hobart; NAR = ATCA). Symbols represent the actual visibilities produced by the
correlators, while the lines represent linear least-squares fits to the visibilities (one line per
dataset).
35
Fig. 4.— S2 (red) and DiFX (black) visibility phase vs time for the 2252 2268 MHz band
on the source PKS 0208512, a s described in the text. Antenna labels as in Figure 2 above.
The PKS-NAR baseline has been shifted by 50 deg for clarity.
36
Fig. 5.— S2 (red) and DiFX (black) visibility amplitude and phase vs frequency data for
the 2252 2268 MHz band on the source PKS 0208512, as described in the text. Antenna
labels as in Figure 2 above. The S2 data has been corrected for fractional-sample error
decorrelation at the band edges as described in the text.
37
Fig. 6.— VLBA correlator (red) and DiFX (black) visibility amplitude vs time for the
2283.49 2291.49 RCP band from the VLBA test observation MT628, as described in the
text. The units of time are seconds from UT 00:00:00, and the amplitude scale is correlation
coefficient. Symbols represent the actual visibilities produced by the correlators, while the
lines represent linear least-squares fits to the visibilities. The text annotation on each panel
lists t he average correlation coefficient amplitude for each correlator over the time period,
as ta bulated in Table 4.
38
Fig. 7.— VLBA correlator (red) and DiFX (black) visibility phase vs time for the 2283.49
2291.49 RCP band from the VLBA test observation MT628, as described in the text. The
units of time are seconds from UT 00:00:00, a nd phase is displayed in degrees.
39
Fig. 8.— VLBA correlator (red) and DiFX (black) visibility amplitude and phase as a
function of frequency fo r the 2283.49 2291.49 RCP band from the VLBA test observation
MT628, as described in the text. The vertical scale for correlation coefficient amplitude on
each panel is 0 0.018, while the phase scale spans ±180 deg. The horizontal scale for each
panel displays channels 0–64.
40
Fig. 9.— A two minute average of the ATCA Parkes cross-power spectrum taken f rom the
software correlated data for t he OH maser G3450.2, as described in the text. The velocity
resolution is 0.038 km/s at the central frequency of 1.72 GHz. The light gray line showing
strong maser emission represents the LCP data and the dark gray line with little emission
represents the RCP data. The maser is highly circularly polarised.
41
Fig. 10.— The cross-power dynamic spectrum showing scintillation variations for the pulsar
B083404 on the Green Bank Telescope Arecibo baseline. Brightness represents the visi-
bility amplitude and colour represents the visibility phase. Increasing frequency runs left to
right and increasing time runs top to bo t t om. This section of the dynamic spectrum repre-
sents just 5% of the time span and 0.5% of the bandwidth of the observation (330 seconds
and 1 60 kHz).
... In order to maximise signal-to-noise on the long baselines for phase calibration, we swapped the phase reference and check sources for the LBA. The data were correlated using the DiFX software correlator (Deller et al. 2007(Deller et al. , 2011, and calibrated according to the standard procedures within the Astronomical Image Processing System (aips, version 31DEC22 ;Wells 1985;Greisen 2003). After the standard external gain calibration, we performed several rounds of hybrid mapping of the phase reference source to derive the time-varying phase, delay, and rate solutions, which we interpolated to J1727. ...
Preprint
Full-text available
Multi-wavelength polarimetry and radio observations of Swift J1727.8-1613 at the beginning of its recent 2023 outburst suggested the presence of a bright compact jet aligned in the north-south direction, which could not be confirmed without high angular resolution images. Using the Very Long Baseline Array and the Long Baseline Array, we imaged Swift J1727.8-1613, during the hard/hard-intermediate state, revealing a bright core and a large, two-sided, asymmetrical, resolved jet. The jet extends in the north-south direction, at a position angle of $-0.60\pm0.07\deg$ East of North. At 8.4 GHz, the entire resolved jet structure is $\sim110 (d/2.7\,\text{kpc})/\sin i$ AU long, with the southern approaching jet extending $\sim80 (d/2.7\,\text{kpc})/\sin i$ AU from the core, where $d$ is the distance to the source and $i$ is the inclination of the jet axis to the line of sight. These images reveal the most resolved continuous X-ray binary jet, and possibly the most physically extended continuous X-ray binary jet ever observed. Based on the brightness ratio of the approaching and receding jets, we put a lower limit on the intrinsic jet speed of $\beta\geq0.27$ and an upper limit on the jet inclination of $i\leq74\deg$. In our first observation we also detected a rapidly fading discrete jet knot $66.89\pm0.04$ mas south of the core, with a proper motion of $0.66\pm0.05$ mas hour$^{-1}$, which we interpret as the result of a downstream internal shock or a jet-ISM interaction, as opposed to a transient relativistic jet launched at the beginning of the outburst.
... The data were correlated in Socorro, New Mexico, with the DiFX 2 software correlator (Deller et al. 2007). The IF bands containing the masers were correlated with 4000 and 2000 spectral channels for the 6.7 GHz CH 3 OH and the 22 GHz H 2 O masers, yielding velocity channels of 0.18 and 0.11 km s −1 , respectively. ...
Preprint
Full-text available
We report measurements of trigonometric parallax and proper motion for two 6.7 GHz methanol and two 22 GHz water masers located in the far portion of the Sagittarius spiral arm as part of the BeSSeL Survey. Distances for these sources are estimated from parallax measurements combined with 3-dimensional kinematic distances. The distances of G033.64$-$00.22, G035.57$-$00.03, G041.15$-$00.20, and G043.89$-$00.78 are $9.9\pm0.5$, $10.2\pm0.6$, $7.6\pm0.5$, and $7.5\pm0.3$ kpc, respectively. Based on these measurements, we suggest that the Sagittarius arm segment beyond about 8 kpc from the Sun in the first Galactic quadrant should be adjusted radially outward relative to previous models. This supports the suggestion of Xu et al. (2023) that the Sagittarius and Perseus spiral arms might merge in the first quadrant before spiraling inward to the far end of the Galactic bar.
... The data were correlated in Socorro, New Mexico, with the DiFX 9 software correlator (Deller et al. 2007). The IF bands containing the masers were correlated with 4000 and 2000 spectral channels for the 6.7 GHz CH 3 OH and the 22 GHz H 2 O masers, yielding velocity channels of 0.18 and 0.11 km s −1 , respectively. ...
Article
Full-text available
We report measurements of trigonometric parallax and proper motion for two 6.7 GHz methanol and two 22 GHz water masers located in the far portion of the Sagittarius spiral arm as part of the BeSSeL Survey. Distances for these sources are estimated from parallax measurements combined with three-dimensional kinematic distances. The distances of G033.64−00.22, G035.57−00.03, G041.15−00.20, and G043.89−00.78 are 9.9 ± 0.5, 10.2 ± 0.6, 7.6 ± 0.5, and 7.5 ± 0.3 kpc, respectively. Based on these measurements, we suggest that the Sagittarius arm segment beyond about 8 kpc from the Sun in the first Galactic quadrant should be adjusted radially outward relative to previous models. This supports the suggestion of Xu et al. that the Sagittarius and Perseus spiral arms might merge in the first quadrant before spiraling inward to the far end of the Galactic bar.
... Pathfinding pulsar astrometry with short-baseline radio interferometry started in the 1970s [74,75]. By the steady progress of the VLBI observation, correlation, and data processing techniques, the astrometry parameters of tens of pulsars have been obtained with the VLBI [76][77][78][79][80]. ...
Article
Full-text available
After two phases of on-site construction and testing (2010–2013 and 2013–2017), the Shanghai Tianma Radio Telescope (TMRT) can work well, with efficiencies better than 50% from 1.3 to 50.0 GHz, mainly benefiting from its low-noise cryogenic receivers and active surface system. Pulsars were chosen as important targets of research at the TMRT because of their important scientific and applied values. To meet the demands of pulsar-related observations, TMRT is equipped with some necessary backends, including a digital backend system (DIBAS) supporting normal pulsar observation modes, a real-time fast-radio-burst-monitoring backend, and baseband backends for very-long-baseline interferometry (VLBI) observations. Utilizing its high sensitivity and simultaneous dual-frequency observation capacity, a sequence of pulsar research endeavors has been undertaken, such as long-term pulsar timing, magnetar monitoring, multi-frequency (or high-frequency) observations, interstellar scintillation, pulsar VLBI, etc. In this paper, we give a short introduction about pulsar observation systems at the TMRT and briefly review the results obtained by these pulsar research projects.
... The observed data were processed using the DiFX software correlator in Daejeon, Korea (Deller et al. 2007;Lee et al. 2015). The DiFX correlator generated a cross-correlation function spectrum with a resolution of 0.125 MHz and an accumulation period of 1 second. ...
Article
Full-text available
We present the results of a spectral analysis using simultaneous multifrequency (22, 43, 86, and 129 GHz) very long baseline interferometry (VLBI) observations of the Korean VLBI Network on BL Lac object, Markarian 421. The data we used were obtained from 2013 January to 2018 June. The light curves showed several flux enhancements with global decreases. To separate the variable and quiescent components in the multifrequency light curves for milliarcsecond-scale emission regions, we assumed that the quiescent radiation comes from the emission regions radiating constant optically thin synchrotron emissions (i.e., a minimum flux density with an optically thin spectral index). The quiescent spectrum determined from the multifrequency light curves was subtracted from the total CLEAN flux density, yielding a variable component in the flux that produces the time-dependent spectrum. We found that the observed spectra were flat at 22–43 GHz, and relatively steep at 43–86 GHz, whereas the quiescent-corrected spectra are sometimes quite different from the observed spectra (e.g., sometimes inverted at 22–43 GHz). The quiescent-corrected spectral indices were much more variable than the observed spectral indices. This spectral investigation implies that the quiescent-spectrum correction can significantly affect the multifrequency spectral index of variable compact radio sources such as blazars. Therefore, the synchrotron self-absorption B-field strength ( B SSA ) can be significantly affected because B SSA is proportional to the fifth power of turnover frequency.
... Except for the Hancock station, all the VLBA stations participated in the observation. The raw VLBA data were correlated using the DiFX software correlator (Deller et al. 2007(Deller et al. , 2011 at Socorro, NM, with an averaging time of 2 s and a frequency resolution of 0.5 MHz. ...
Article
Full-text available
The BL Lacertae object OJ 287 is a very unusual quasar producing a wobbling radio jet and some double-peaked optical outbursts with a possible period of about 12 yr for more than one century. This variability is widely explained by models of binary supermassive black holes (SMBHs) or precessing jets/disks from a single SMBH. To enable an independent and nearly bias-free investigation on these possible scenarios, we explored the feasibility of extremely high-precision differential astrometry on its innermost restless jet at millimeter wavelengths. Through revisiting some existing radio surveys and very long baseline interferometry (VLBI) data at frequencies from 1.4 to 15.4 GHz and performing new Very Long Baseline Array observations at 43.2 GHz, we find that the radio source J0854+1959, 7.′1 apart from OJ 287 and with no clearly seen optical and infrared counterparts, could provide a nearly ideal reference point to track the complicated jet activity of OJ 287. The source J0854+1959 has a stable GHz-peaked radio spectrum and shows a jet structure consisting of two discrete, milliarcsecond-scale-compact and steep-spectrum components and showing no proper motion over about 8 yr. The stable VLBI structure can be interpreted by an episodic, optically thin, and one-sided jet. With respect to its 4.1 mJy peak feature at 43.2 GHz, we have achieved an astrometric precision at the state-of-art level, about 10 μ as. These results indicate that future VLBI astrometry on OJ 287 could allow us to accurately locate its jet apex and activity boundary, align its restless jet structure over decades without significant systematic bias, and probe various astrophysical scenarios.
... Section 3 presents these multi-frequency and multi-epoch VLBI imaging results and the broadband radio spectrum of J0854+1959. We interpret its jet structure and discuss its goodness for future stateof-the-are astrometry on the wobbling jet of OJ 287 and possible direct scientific implication in Section 4. We give our conclusions in Section 5. (Deller et al. 2007(Deller et al. , 2011 at Socorro, NM, with an averaging time of 2 s and a frequency resolution of 0.5 MHz. ...
Preprint
Full-text available
The BL Lacertae object OJ 287 is a very unusual quasar producing a wobbling radio jet and some double-peaked optical outbursts with a possible period of about 12 yr for more than one century. This variability is widely explained by models of binary supermassive black hole (SMBH) or precessing jet/disk from a single SMBH. To enable an independent and nearly bias-free investigation on these possible scenarios, we explored the feasibility of extremely high-precision differential astrometry on its innermost restless jet at mm-wavelengths. Through re-visiting some existing radio surveys and very long baseline interferometry (VLBI) data at frequencies from 1.4 to 15.4 GHz and performing new Very Long Baseline Array (VLBA) observations at 43.2 GHz, we find that the radio source J0854$+$1959, 7.1 arcmin apart from OJ 287 and no clearly-seen optical and infrared counterparts, could provide a nearly ideal reference point to track the complicated jet activity of OJ 287. The source J0854$+$1959 has a stable GHz-peaked radio spectrum and shows a jet structure consisting of two discrete, mas-scale-compact and steep-spectrum components and showing no proper motion over about 8 yr. The stable VLBI structure can be interpreted by an episodic, optically thin and one-sided jet. With respect to its 4.1-mJy peak feature at 43.2 GHz, we have achieved an astrometric precision at the state-of-art level, about 10 $\mu$as. These results indicate that future VLBI astrometry on OJ 287 could allow us to accurately locate its jet apex and activity boundary, align its restless jet structure over decades without significant systematic bias, and probe various astrophysical scenarios.
Article
We present the results of KVN Key Science Program (KSP) for evolved stars, which was launched in 2014. The first phase of KSP ended in June 2020 and the second phase started in October 2020. The goal of KSP is to study the physical characteristics of the evolved stars by observing the spatial distribution and temporal variability of the stellar masers at four frequency-bands (K, Q, W and D bands). The 22 GHz H 2 O maser is usually observed from the outer part of circumstellar envelopes compared to the 43, 86, 129 GHz SiO masers, thus the kinematic links between these regions can be studied by the multi-frequency simultaneous observations of KSP along the stellar pulsation cycles. This eventually enable us to study the enormous mass-loss rate of evolved stars, and the accumulated results from KSP are expected to shed light on the study of the late stage of the stellar evolution.
Article
Full-text available
We describe the design of a radio interferometer composed of a Global Navigation Satellite Systems (GNSS) antenna and a Very Long Baseline Interferometry radio telescope. Our eventual goal is to use this interferometer for geodetic applications including local tie measurements. The GNSS element of the interferometer uses a unique software‐defined receiving system and modified commercial geodetic‐quality GNSS antenna. We ran three observing sessions in 2022 between a 25 m radio telescope in Fort Davis, Texas (FD‐VLBA), a transportable GNSS antenna placed within 100 m, and a GNSS antenna placed at a distance of about 9 km. We have detected a strong interferometric response with a Signal‐to‐Noise Ratio (SNR) of over 1,000 from Global Positioning System and Galileo satellites. We also observed natural radio sources including Galactic supernova remnants and Active Galactic Nuclei located as far as one gigaparsec, thus extending the range of sources that can be referenced to a GNSS antenna by 18 orders of magnitude. These detections represent the first observations made with a GNSS antenna to radio telescope interferometer. We have developed a novel technique based on a Precise Point Positioning solution of the recorded GNSS signal that allows us to extend integration time at 1.5 GHz to at least 20 min without any noticeable SNR degradation when a rubidium frequency standard is used.
Article
To accurately link the radio and optical Celestial Reference Frames (CRFs) at optical bright end, i.e. with GaiaG-band magnitude $\lesssim$13, increasing number and improving sky distribution of radio stars with accurate astrometric parameters from both Very Long Baseline Interferometry (VLBI) and Gaia measurements are mandatory. We selected two radio stars HD 199178 and AR Lacertae as the target for a pilot program for the frame link, using the Very Long Baseline Array at 15 GHz at six epochs spanning about 1 yr, to measure their astrometric parameters. The measured parallax of HD 199178 is 8.949 ± 0.059 mas and the proper motion is μαcos δ = 26.393 ± 0.093 and μδ = −0.950 ± 0.083 mas yr−1, while the parallax of AR Lac is 23.459 ± 0.094 mas and the proper motion is μαcos δ = −51.906 ± 0.138 and μδ = 46.732 ± 0.131 mas yr−1. Our VLBI measured astrometric parameters have accuracies about 4–5 times better than the corresponding historic VLBI measurements and comparable accuracies with those from Gaia, validating the feasibility of frame link using radio stars. With the updated astrometric parameters for these two stars, there is a ∼25 per cent reduction of the uncertainties on the Y-axis for both orientation and spin parameters.
Book
Full-text available
An overview of the basics of radio astronomy is presented as well as a short history of the development of radio interferometry. The underlying relationships of interferometry are discussed with consideration given to the coordinate systems and parameters that are required to describe synthesis mapping and the configurations of antennas for multielement synthesis arrays. Other topics include the response of the receiving system, digital signal processing, VLBI, calibration and Fourier transformation of visibility data, interferometer techniques for astrometry and geodesy, propagation effects, and radio interference.
Article
Pulsed coherent radiation at 102.6 nm was generated via four-wave sum-frequency mixing in atomic hydrogen using electromagnetic-field coupling to produce induced transparency. Radiation at 243 and 657 nm was focused into an effusive beam of atomic hydrogen where resonantly enhanced, nonlinear mixing occurred. The 243 nm (pump) radiation was chosen to be in two-photon resonance with the 2s-1s frequency difference, while the 657 nm (coupling) radiation strongly coupled the 3p and 2s levels resulting in suppression of the linear susceptibility of the system and reduction in absorption of the generated radiation due to induced transparency. Experiments were carried out under low atomic density conditions to study the single-atom response of the generation process. A measure of the induced transparency, given by the linear susceptibility, was obtained from ions generated in the process. Measurements of the VUV power and ion signal were carried out under various conditions of the incident laser fields, and the results were shown to be in general agreement with recent steady-state theories of nonlinear mixing with induced transparency. The application of a dc electric field in excess of {~}4 kV/cm was found to suppress VUV generation, and interfere with induced transparency. Nonlinear generation with low coupling power (
Article
We have made a comparison between data between the Mitaka and Penticton correlators, using VSOP observations of NRAO530 at 1.6 GHz. After calibrating the data with AIPS we find excellent agreement with the closure phase between the data processed at the two correlators. The images made from both data sets also agree well. We have made other VSOP images of OQ208 at 1.6 GHz, and 3C395, J2011-15 at 5 GHz from data correlated by the Mitaka correlator. These images are all consistent with ground-only VLBI results at higher frequencies, and demonstrate the capability of the Mitaka correlator as a VSOP correlator.
Article
In this thesis we provide analytic expressions for the quadratic density response function, establish a series of sum rules and show how the quadratic response provides an improved dynamical local field correction. The closed forms for the quadratic density-density response function c0 (k1, omega1 k 2, omega2) are calculated for a non-interacting Fermi system at zero temperature in one, two, and three dimensions. In the analysis of the result we focus on the two-dimensional case which exhibits a number of interesting features. We find that c0 (k1, omega1 k 2, omega2) vanishes not only in the static long wavelength limit as predicted by the quadratic compressibility sum rule but that there is an adjacent kl- omega l-region where c0 (k1, omega1 k 2, omega2) is purely imaginary. Depending on the angle between k1 and k2 the onset of the real part of c0 can be discontinuous. This and other discontinuities in the real part which all are accompanied by logarithmic singularities in the imaginary part are scrutinized. A systematic series of frequency moments for the quadratic density response and conservation sum rules for the quadratic dynamic structure function is established. Similarly to the linear case the lower order sum rules are exhausted by the RPA structure, but correlational effects govern higher order sum rules. We demonstrate that the correlational correction to the linear response function represents by the dynamical local field correction G( k, omega) is intimately related to the quadratic density response function through the velocity-average approximation. This approach allows a non-perturbative calculation of G(k, omega) which is superior to the widely used STLS.
Article
The Mark III very-long-baseline interferometry (VLBI) system allows recording and later processing of up to 112 megabits per second from each radio telescope of an interferometer array. For astrometric and geodetic measurements, signals from two radio-frequency bands (2.2 to 2.3 and 8.2 to 8.6 gigahertz) are sampled and recorded simultaneously at all antenna sites. From these dual-band recordings the relative group delays of signals arriving at each pair of sites can be corrected for the contributions due to the ionosphere. For many radio sources for which the signals are sufficiently intense, these group delays can be determined with uncertainties under 50 picoseconds. Relative positions of widely separated antennas and celestial coordinates of radio sources have been determined from such measurements with 1 standard deviation uncertainties of about 5 centimeters and 3 milliseconds of arc, respectively. Sample results are given for the lengths of baselines between three antennas in the United States and three in Europe as well as for the arc lengths between the positions of six extragalactic radio sources. There is no significant evidence of change in any of these quantities. For mapping the brightness distribution of such compact radio sources, signals of a given polarization, or of pairs of orthogonal polarizations, can be recorded in up to 28 contiguous bands each nearly 2 megahertz wide. The ability to record large bandwidths and to link together many large radio telescopes allows detection and study of compact sources with flux densities under 1 millijansky.
Article
Modern VLBI observations for both astronomy and geodesy continue to demand the utmost in sensitivity. Of the methods potentially available for increasing the sensitivity of continuum VLBI observations, increasing the recorded bandwidth is generally the most cost effective. Over the past two years a broadly-supported program has been underway at Haystack Observatory to increase the sensitivity of the Mark IIIA VLBI system by more than a factor of 2. The result is an upgrade to the existing Mark IIIA data-acquisition system, dubbed Mark IV, which increases the maximum data rate to 1024 Mbits/sec, more than quadrupling the maximum data-rate of the Mark IIIA.. A new correlator, based on a new custom VLSI correlator chip is also being designed to support the 1 Gbit/sec data rates from the Mark IV data-acquisition-system. An international collaborative effort is being mounted to help defray the high costs of development.
Article
The practical aspects of interferometry are reviewed, starting with a two element interferometer.