ArticlePDF Available

DiFX: A Software Correlator for Very Long Baseline Interferometry Using Multiprocessor Computing Environments

February 2007
Publications of the Astronomical Society of the Pacific 119(853):318-336

February 2007
119(853):318-336

DOI:10.1086/513572

Source
arXiv

Authors:

Steven Tingay

Curtin University

We describe the development of an FX-style correlator for very long baseline interferometry (VLBI), implemented in software and intended to run in multiprocessor computing environments, such as large clusters of commodity machines (Beowulf clusters) or computers specifically designed for high-performance computing, such as multiprocessor shared-memory machines. We outline the scientific and practical benefits for VLBI correlation, these chiefly being due to the inherent flexibility of software and the fact that the highly parallel and scalable nature of the correlation task is well suited to a multiprocessor computing environment. We suggest scientific applications where such an approach to VLBI correlation is most suited and will give the best returns. We report detailed results from the Distributed FX (DiFX) software correlator running on the Swinburne supercomputer (a Beowulf cluster of ~300 commodity processors), including measures of the performance of the system. For example, to correlate all Stokes products for a 10 antenna array with an aggregate bandwidth of 64 MHz per station, and using typical time and frequency resolution, currently requires an order of 100 desktop-class compute nodes. Due to the effect of Moore's law on commodity computing performance, the total number and cost of compute nodes required to meet a given correlation task continues to decrease rapidly with time. We show detailed comparisons between DiFX and two existing hardware-based correlators: the Australian Long Baseline Array S2 correlator and the NRAO Very Long Baseline Array correlator. In both cases, excellent agreement was found between the correlators. Finally, we describe plans for the future operation of DiFX on the Swinburne supercomputer for both astrophysical and geodetic science.

Overview of the software correlator architecture. Data is loaded into memory from a disk or network connection by Datastream nodes. These nodes are directed by a Master node to send data from given time ranges (typically several ms) to the processing elements (Core nodes). The processed data are sent to the master node for long-term accumulation and storage on disk.

…

. Comparison of existing hardware correlator parameters

…

S2 (red) and DiFX (black) visibility amplitude vs time for the 2252-2268 MHz band on the source PKS 0208−512, as described in the text (PKS = Parkes; MOP = Mopra; HOB = Hobart; NAR = ATCA). Symbols represent the actual visibilities produced by the correlators, while the lines represent linear least-squares fits to the visibilities (one line per dataset).

…

S2 (red) and DiFX (black) visibility phase vs time for the 2252-2268 MHz band on the source PKS 0208−512, as described in the text. Antenna labels as in Figure 2 above. The PKS-NAR baseline has been shifted by −50 deg for clarity.

…

A two minute average of the ATCA-Parkes cross-power spectrum taken from the software correlated data for the OH maser G345−0.2, as described in the text. The velocity resolution is 0.038 km/s at the central frequency of 1.72 GHz. The light gray line showing strong maser emission represents the LCP data and the dark gray line with little emission represents the RCP data. The maser is highly circularly polarised.

…

Figures - uploaded by Steven Tingay

Content may be subject to copyright.

Content uploaded by Steven Tingay

Content may be subject to copyright.

arXiv:astro-ph/0702141v1 6 Feb 2007

DiFX: A software correlator for very long baseline interferometry

using multi-processor computing environments

A.T. Deller

, S.J. Tingay, M. Bailes, & C. West

Centre for Astrophysics and Supercomputing, Swinburne University of Technology, Mail

H39, P.O. Box 218, Hawthorn, Victoria 3198, Australia

ABSTRACT

We describe the development of an FX style correlator for Very Long Base-

line Interferometry (VLBI), implemented in software and intended to run in

multi-processor computing environments, such as large clusters of commodity

machines (Beowulf clusters) or computers speciﬁcally designed for high perfor -

mance computing, such a s multi-processor shared-memory machines. We outline

the scientiﬁc and practical beneﬁts for VLBI correlation, these chieﬂy being due

to the inherent ﬂexibility of software and the fact that the highly parallel and

scalable nature of the correlation task is well suited to a multi-processor com-

puting environment. We suggest scientiﬁc applications where such an approach

to VLBI correlation is most suited and will give the best r eturns. We report

detailed results from the Distributed FX (DiFX) software correlator, running

on the Swinburne supercomputer (a Beowulf cluster of ∼300 commodity pro-

cessors), including measures of the performance of the system. For example, to

correlate all Stokes pro ducts for a 10 antenna array, with an aggregate band-

width of 64 MHz per station and using typical time and frequency resolution

presently requires o f order 100 desktop-class compute nodes. Due t o the eﬀect

of Moore’s Law on commodity computing performance, the total number and

cost of compute nodes required to meet a given correlation task continues to de-

crease rapidly with time. We show detailed comparisons between DiFX and two

existing hardware-based correlators: the Australian Long Baseline Array (LBA)

S2 correlator, and the NRAO Very Long Baseline Array (VLBA) correlator. In

both cases, excellent agreement was found between the correlators. Finally, we

describe plans for the future operation of DiFX on the Swinburne supercomputer,

for both astrophysical and geodetic science.

co-supervised through the Australia Telescope Natio nal Facility, P.O. Box 7 6, Epping, NSW 1710, Aus-

tralia

Current address: University of Massachusetts Amherst, Department of Astronomy, 71 0 North Pleasant

St, Amherst, MA 01 003-9305, USA

– 2 –

Subject headings: Techniques: interferometric — instrumentation: interferome-

ters — pulsars: general — radio continuum: general — radio lines: general

1. Introduction

The technique of Very Long Baseline Interferometry (VLBI), as a means to study the

very high angular resolution structure of celestial ra dio sources, was developed in the 1960s

(Clark, Cohen & Jauncey 1967; Moran et al. 1967). Some accounts of the early develop-

ments in VLBI, the scientiﬁc motivations for the developments, and t echnical overviews are

given in Finley & Go ss (2000).

VLBI, as with all interferometry at radio wavelengths, hinges on the abilty to obtain

a digital representation of the electric ﬁeld va ria t io ns at a number of spatially separated

locations (radio telescopes), accurately time-tagged a nd tied to a frequency standard. The

digitised data are transported to a single location for processing (a correlator) and are co-

herently combined in order to derive information about the high angular resolution structure

of the ta rget sources of radio emission. The instantaneous angular resolution R of a VLBI

array in arcseconds is given by R = 2.52 × 10

, where λ is wavelength of the ra diatio n

being observed (typically centimetres) and D is the maximum projected baseline (the dis-

tance b etween radio telescopes in the array projected onto a plane perpendicular to the

source; typically thousands of kilometers). This yields typical angular resolutions of order

milliarcseconds.

Traditionally, the “baseband” data (ﬁltered, down-converted, sampled, and quantised

electric ﬁeld strength measurements: Thompson, Moran & Swenson 1994) generated at each

radio telescope have been recorded to magnetic tape media, for example: the Mark I system

(Bare et al. 1967); the Mark II system (Clark 1973); the Mark III system (Rogers et al.

1983), the Mark IV system (Whitney 1993); and the S2 system (Wietfeldt et al. 1996). After

observation, the tapes from each telescope were shipped to a purpose-built a nd dedicated

digital signal processor, the correlator. A correlator aligns the recorded data streams, corrects

for various geometrical and instrumental eﬀects, and coherent ly combines the data from the

diﬀerent independent pairs o f radio telescopes. The correlator output streams, known as t he

visibilities, are related to the sky brightness distribution of the radio source essentially via a

Fourier transform relation (Thompson, Moran & Swenson 19 94).

The two fundamental operations required to combine or correlate the recorded signals

are a Fourier transform (F) and a cross-multiplication (X). The order of these operations

can be interchanged to obtain the same result, leading to the so-called XF and FX correlator

– 3 –

architectures. A number of well-known descriptions of the theory and practise of radio

int erferometry describe the technique in varying degrees of detail and elaborat e upon the

diﬀerences between XF and FX correlators (Thompson, Moran & Swenson 1994; Romney

1999), and the reader is referred to these texts for the details.

Both XF and FX style correlators have traditionally been highly application-sp eciﬁc de-

vices, based on purpose-built integrated circuits. In the last 20 years, Field Programmable

Gate Arrays ( FPGAs) have become popular in correlator designs, with one prominent exam-

ple being the Very Long Ba seline Array (VLBA) correlator (Napier et al. 1994). FPGAs are

reconﬁgurable or reprogrammable devices that oﬀ er more ﬂexibility than application-speciﬁc

int egrated circuits (ASICs) while still being highly eﬃcient.

This paper deals with a departure from the traditional approach of tape-based data

recording and correlation on a purpose-built processor (based on either ASICs or FPGAs).

We have developed a correlator that is based on softwa re known as DiFX (Distributed

FX), which runs within a generic multi-processor computing environment. Such a correlator

int erfa ces naturally to modern hard-disk data recording systems, such as the MkV system

(Whitney 2002) and the K5 system (K ondo et al. 2003), that have now largely replaced ta pe-

based recording systems. Speciﬁcally, we have developed this software correlator to support

a new disk-based VLBI recording system that has been deployed across the Australian Long

Baseline Array

(LBA) for VLBI. We refer the reader to a detailed discussion of the LBA

hard-disk recording system (LBADR ) that appears elsewhere (Phillips et al. 2007, in prepa-

ration). As our software correlator is more broadly applicable than to just the LBA, we will

not dwell on the details of the LBA recording system in this paper, but rather concentrate on

the characteristics, beneﬁts, and performance of our software correlator, giving brief details of

the recording system when required. The correlator source code, binaries and instructions for

use ar e available for download from http://astronomy.swin.edu.au/~adeller/software/difx/.

The very ﬁrst VLBI observations were in fact correlated using software on a main-

frame computer. Software correlators were developed simultaneously on an IBM 360/50

at t he National Radio Astronomy Observatory (NRAO) (Bare et al. 19 67) and on an IBM

360/92 at the Goddard Space Flight Centre (Moran et al. 1967). As the early experiments

quickly increased in complexity the recorded data volume also increased and it became nec-

essary to design custom hardware for VLBI correlation. Recent examples of such correlators

include: the NRAO Very Long Baseline Array correlator (Napier et al. 1 994); the Joint In-

stitute for VLBI in Europe (JIVE) correlator (Casse 1999); the Canadian NRC S2 correlator

(Carlson et al. 1999); the Japanese VLBI Space Observatory Programme (VSOP) correlator

http://www.atnf.csiro .au/vlbi

– 4 –

(Horiuchi et al. 2000); and the Australia Telescope National Facility (ATNF) S2 correlator

(Wilson, Roberts & Davis 1996). Table 1 compares some of the basic properties of some

currently-operational hardware VLBI correlators.

Recently, the pace of development of commodity computing equipment (processors, stor-

age, networking etc) has outstripped increases in VLBI computational requirements to the

point that the correlation of VLBI data using relatively inexpensive supercomputer facili-

ties is feasible. The correlation algorithm is “ embarrassingly parallel” and very well suited

to such parallel computing architectures. These facilities are not purpose-built f or corre-

lation but are inherently multi-purpose machines, suited to a wide range of computational

problems.

This approach to correlation gives rise to signiﬁcant scientiﬁc beneﬁts, under certain

circumstances. The beneﬁts stem from the basic char acteristics of correlation, software engi-

neering considerations, and the computing environments. Software is more ﬂexible and easier

to redesign than application-speciﬁc hardware or even FPGA-based processors (although the

programming tools for FPGAs are developing rapidly). The highly parallel nature of the

correlation problem, coupled with the availability of high-level programming languages and

optimised vector libraries means that a reasonably general software correlator code can be

written quickly and be used in a va r iety of diﬀerent computing environments with minimum

modiﬁcation, or in a dynamic environment where computing resources and/or signiﬁcant

scientiﬁc r equirements can change rapidly with time.

However, the trade-oﬀ for ﬂexibility and the convenience of high-level programming tools

is reduced eﬃciency for any given task, compared to an application-speciﬁc or FPGA-based

solution. Put simply, the Non-Recoverable Engineering (NRE) costs for a software correlator

are much lower than for a hardware correlator, but the cost per unit processing power is

higher. Thus, the limited computation needed by a small size correlator means a software

approach will be cheaper overall, while the tremendous computational requirements of corre-

lators on the scale required for t he Expanded Very Large Array (EVLA) or Atacama Large

Millimetre Array (ALMA) dictate that the substantial amounts of NR E spent optimising

hardwar e are worthwhile, at least in 2006.

Software also has an advantage over hardware if the additional support required fo r

unusual or stringent VLBI experiments is impossible or impractical to implement in an

existing hardware correlator. An example of this is given in §4.3. Use of a software correlator

in these cases, even at possibly reduced eﬃciency, is preferable to the expense of building or

altering dedicated hardware.

A good example of the ﬂexibility of software correlation and its trade-oﬀ with eﬃciency is

– 5 –

spectral resolution capability. A generic modern CPU is capable of calculating multi-million

point one-dimensional Fast Fourier Transforms (FFTs), allowing an FX style software corre-

lator utilising this CPU as a processing element to give extremely high frequency resolution:

a million spectral points across the frequency bandwidth of an observation.

Such a correlation would be computationally intensive, a s conventional CPUs are not

optimised for such operations. However, it could be carried out using exactly the same

software and hardware as is used for a generic continuum experiment. Comparison to Table

1 shows that such high spectral resolution is currently impossible on existing hardware

correlators. A number of limitations o n particular hardware correlator implementatio ns,

such as minimum integration times, maximum input data rates, and maximum output data

rates, can be overcome in a similar fashion with software correlators.

The ﬂexibility, inexpensive nature, and ease of production of software correlators makes

them particularly useful for small to medium sized VLBI arrays, since development times

are short, costs are low, and the capabilities are high, providing niche roles for even small

facilities. These factors have led to a resurgence in software correlator applications in a

number of groups around the world. In addition to the eﬀorts described here at the Swin-

burne University of Technology, a gr oup have developed a software correlator, mainly for

geodetic VLBI, at the Communications Research Laboratory (CRL) in Japan (Kondo et al.

2003). This CRL code is also used for real-time fringe checks during observations on the

European VLBI Network (EVN), operated from JIVE

. Also at JIVE, a software correlator

has been developed and used to process VLBI observatio ns that tracked the Huygens probe

as it entered the atmosphere of Titan ( Pogrebenko et al. 2003). Spacecraft tracking with

VLBI and software correlation is likely to become a more recognised technique following the

Huygens success, for example for the Chinese Chang’E lunar mission

. Finally, the most

ambitious example of a software correlator is the Low Frequency Array (LOFAR) correlator,

which is implemented on an IBM BlueGene/L supercomputer containing 12 ,0 00 processors

This software correlator rivals the most powerful hardware correlators currently operating

or in the design stage, but diﬀers from the software correlator described in this pap er in that

hardwar e speciﬁc o ptimisations and large amounts of NRE were utilised.

The approach we used in the development of the software correlator was la r gely inspired

by the previous success of a group at Swinburne who developed baseband signal processing

software for multi-processor environments, fo r the purposes of pulsar studies (Bailes 2003).

Details about the process and results can be found at http://www.evlbi.org/evlbi/tevlb8/ tevlb8.html

http://en.cast.cn

http://www.lofar.org

– 6 –

A prot otype software correlator developed at Swinburne is described in West (2004), with

initial results described in Horiuchi et al. (2006).

In this paper we concentrate on a description of the DiFX software correlator for VLBI

developed at the Swinburne University of Technology, motivated by the factors discussed

above. This correlator has been used as part of the Australia Telescope National Facility

(ATNF) VLBI operations since 2005 and has now replaced the previously used ATNF S2

correlator. The particular architecture we have adopted (§2.1, 2.2 and 2.3), is discussed only

brieﬂy, as the correlation algorithm has been discussed at length in the literature. §3 describes

the DiFX correlator, including the details of the software implementation, veriﬁcation results

from comparisons with two established hardware correlators, and performance ﬁgures-of-

merit. We illustrate some examples of speciﬁc scientiﬁc applications that can beneﬁt from

software correlation in §4. Finally, our conclusions are presented in §5.

2. The FX software correlator architecture

Many previous works develop in detail the theory of radio interferometry (Thompson, Moran & Swenson

1994; Thompson 1999). The reader is referred to these texts for a complete discussion of

the technique. Here we discuss the main steps used to implement the correlator architecture

(FX) that we have adopted.

A more extensive overview of correlator operations is given in Romney (1999). We

do not describe the operations at the telescopes that convert the incident electric ﬁeld at

sky frequency to the ﬁltered, down-converted, sampled, and digitised data streams that are

recorded to disk (baseband data in our terminology).

A number of the initial operations are made on the telescope-based data streams. A

number of the later operations are baseline-based. These two sets of operations are brieﬂy

described separately and in sequence.

2.1. Antenna-based operations

2.1.1. Alignment of telescope data streams

To correlate data from a number of diﬀerent telescopes, the changing delays between

those telescopes must be calculated and used to align the recorded data streams at a prede-

termined point in space (in this case the geocentre) throughout the experiment.

– 7 –

The Swinburne software correlator uses CALC 9

to generate a geometric delay model

(τ(t)) for each telescope in a given observation, at regular intervals (usually 1 second).

CALC models many geometric eﬀects, including precession, nutation, ocean and atmospheric

loading, and is used by many VLBI correlators including the VLBA and JIVE correlators.

These delays are then interpolated (using a quadratic approximation) to produce accurate

delays (∆τ < 1 × 10

−15

sec, compared to an exact CALC value) in double precision for any

time during the course of the observation. The estimated station clock oﬀets and ra tes are

added to the CALC-generated geometric delays.

The baseband data for each telescope are loaded into la rge buﬀers in memory, and the

int erpolated delay model is used to calculate the accurate delay between each telescope and

the centre of the Earth at any given time during the experiment. This delay, rounded to

the nearest sample, is the integer sample delay. The diﬀerence between the delay and the

int eger sample delay is recorded as the antenna ba sed fractional sample delay (up to ± 0.5

sample). Note that the a lignment of any two data streams (as opposed to a data stream

alignment with the geocentre) is good to ± 1 sample.

The integer-sample delay is used t o oﬀset the data pointer in memory and select the

data to be correlated (some number of samples which is a power of 2, starting from the time

of alignment). The fractional sample error is retained to correct the phase as a function

of frequency following alignment to within one sample, fringe rotation, and channelisation

(§2.1.3).

Once the baseband data for each telescope have been selected, they are transferred to

a processing node and unpacked from the coa r sely quant ised representa tion (usually a 2-bit

representation) to a ﬂo ating point (single precision) representation. From this point on, all

operations in the correlator are perfor med using ﬂoating point arithmetic, in single precision

unless otherwise speciﬁed. Note that the data volume is expanded by a factor of 16 at

this point. The choice of single precision ﬂoats (roughly double the precision necessary)

wa s dictated by the capabilities of modern CPUs, which process ﬂoats eﬃciently. Using

suﬃcient precision also avoids the small decorrelation losses incurred by optimised, low

precision operations often used in hardware correlators. This is a good example of the

sacriﬁce of eﬃciency for simplicity and accuracy with a software correlator.

At this point all data streams from all telescopes are aligned to within ± 1 sample

of each other and t he fractional sample errors for each of the telescope data streams are

recorded for later use. A set number of samples from each telescope data stream have been

selected and are awaiting processing on a common processing node (e.g. a PC in a Beowulf

http://gemini.gsfc.nasa.gov/solve

– 8 –

cluster).

2.1.2. Fringe rotation

Fringe rotation compensates for the changing phase diﬀerence introduced by delaying

the signal from each telescope to the geocentre after it has been downconverted to baseband

frequencies. If the changing delay, τ(t), could be compensated for at sky frequency, fringe

rotation would not be required. This, however, is impractical.

The necessary fringe rotation function can be calculated at any po int in time by taking

the sine a nd cosine of the geocentric delay multiplied by the sky frequency ν

; it is a pplied

via a complex multiplication f or each telescope’s data stream.

Since the baseband data have already been unpacked to a ﬂoating point representation

by this stage, a ﬂoating point fringe rotation is applied which yields no fringe rotation losses,

compared, for example, to a 6.25% loss of signal to noise for three level digital fringe rotation

in a two level complex correlator (Roberts 1997).

Implemented as such, fringe rotation represents a mixing operation and will result in a

phase diﬀerence term which is quasi-stationary at zero phase (the desired term) and a phase

sum term which has a phase rate of twice the fringe rotation function, ∼ 4πν

τ(t). The sum

term vector averages to a (normally) negligible contribution to the correlator; for typical

VLBI fringe rates (100s of kHz) and integration times (seconds) the relative magnitude of

the unwanted contribution to each visibility point is < 10

−5

. In a software correlator it

wo uld be simple to control the integration time so that the rapidly varying phase term is

int egrated over exactly an integral number of terms of phase, thus making no contribution

to the correlator output. This feature is not currently implemented in DiFX.

We have thus far described fringe rotation as a phase shift for each sample in the time

domain. If perfor med in this manner, we refer to the fringe rotation as “pre–F” ( under an

FX architecture), as it has been applied before the transformation to the frequency domain

in the channelisation process (§2.1.3 ). In this case, the geometric delay for each sample is

int erpolated using the delay model as described in §2.1.1 above.

In cases where the fringe rotation to be applied changes little from the ﬁrst sample in the

FFT window to the last, a minimal amount of decorrelation is introduced by applying a single

fringe rotation for the entire window. The decorrelation can be estimated by sinc (∆φ/2),

where ∆φ = 2πν

∆τ is the change in baseline phase due t o Earth rotation over the FFT

window.

– 9 –

In this way, fringe rotation can be applied a fter channelisation, which saves considerable

computational eﬀort (“post–F ” fringe rotation). For this approach to be viable, the fringe

rates should be low (ie low frequencies and/or short baselines) and the number of channels

should be small (implying that the time range of the samples to be correlated is short

compared to the fringe period). Table 2 shows the degree of decorrelation which would be

incurred by utilising post–F fringe rotation for a range of VLBI observation modes. This

decorrelation is simple to calculate and could be used to correct the visibility amplitudes and

alter visibility weights, although this is not presently implemented in DiFX. It is important

to no te that t he use of post–F fringe rotation is not recommended for all situations shown

in Table 2, and indeed is o nly intended for use when the resultant decorrelation is ≪ 1 %.

Post–F fringe rotat io n is desirable in situations where the fringe rate is extremely low,

when the double-frequency term introduced by the mixing operation of pre–F fringe rota tion

is not eﬀectively averaged to zero over the course of an integration and makes a signiﬁcant

and undesirable contribution to the correlator output. Switching from pre–F to post–F

fringe rotation would be beneﬁcial f or periods of time in most experiments when the source

traverses periods of low phase rate. Sources near a celestial pole can have very low fringe

rates for long periods of time. Alternatively, if very short correlator integration t imes are

used, the sum term may not integrate to zero when using pre–F fringe rotation. Post-F

fringe rotation would therefore be a natural choice in these circumstances.

It should be noted that it is possible to undertake the exact equivalent to pre–F fringe

rotation in the frequency domain. However, this would involve the Fourier transform of

the fringe rotation f unction and a convolution in the frequency domain, which is at least as

computationally intensive as the complex multiplication of the data and fringe rotation in

the time domain.

DiFX implements pre–F or post–F fringe rotation as a user controlled option.

2.1.3. Channelisation and fractional sample error correction

Once the data are aligned and phase corrected after fringe rotation, the time series data

are converted into frequency series data (channelised), prior t o cross multiplication.

Channelisation of the data can be accomplished using an FFT (Fast Fourier Tr ansform)

or a digital ﬁlterbank. If used, the ﬁlterbank is implemented in a polyphase fa shion, which

essentially inserts a decomposed ﬁlter before an FFT (Bellanger & Daguet 2004). This allows

the channel response to be changed from the sinc

response natural to a FX correlator to

any desired f unction. In practise, an approximation to a rectangle is applied, although the

– 10 –

length of the ﬁlter (and hence the accuracy of the approximation) is tunable.

If pre-F fringe rotation has been applied, the data are already in complex form, and

so a complex-to-complex FFT is used. The positive or negative frequencies are selected in

the case of upper or lower sideband data respectively. If post-F fr inge rota tion is to be

applied, the data are still real and so a more eﬃcient real-to-complex FFT may be used.

This is possible due to the conjugate symmetry property of an FFT of a real data series. In

this case, lower sideband data may be recovered by reversing and conjugating the resultant

channels.

The ﬁnal station-based operation is fractional-sample correction (Romney 1999). This

step is considerably easier in an F X correlator than an XF implementation, since the con-

version to the frequency domain before correlation allows the fractional error to be corrected

exactly, assuming the error to be constant over an FFT length. This is equivalent to the

assumption made for post-F fringe rotation, but is considerably less stringent since the phase

change is proportio nal to the subband bandwidth, rather t han sky frequency as in the case

of fringe ro tation. The frequency domain correction manifests itself as a slope in the phase

as a function of frequency across the observed bandwidth.

Thus, after channelisation, a further complex multiplication is applied t o the channels,

correcting the fractional sample error. In the case of post-F fringe rotation, the fringe

rotation value is added to the fractional-sample correction and the two steps are performed

together.

Either simple FF T or digital polyphase ﬁlter bank channelisation can be selected as a

user controlled option in DiFX.

2.2. Baseline-based operations

2.2.1. Cross multiplication of telescope data streams

For each baseline, the channelised data from the telescope pair are cross-multiplied on a

channel by channel basis (after f orming the complex conjugate for the channelised data from

one telescope) to yield the frequency domain complex visibilities that are the fundamental

observables of an interferometer. This is repeated for each common band/polarisation on a

baseline, and fo r a ll baselines. If dual polarisations have been recorded for any given band,

the cross-polarisation terms can also be multiplied, allowing polarisation information for the

target source to be recovered.

– 11 –

2.2.2. Integration of correlated output

Once the above cycle of operations has been completed, it is repeated and the resulting

visibilities accumulated (complex added) until a set accumulation time has b een reached.

The number of ”good” cycles per t elescope is recorded, which could form t he basis of a

data weighting scheme, although weights are not currently recorded in DiFX. Generally,

on each cycle the input time increment is equal to the corresponding FFT length (twice

the number of spectral points), but it is also possible to overlap FFTs. This allows more

measurements of higher lags and greater sensitivity to spectral line observations, at the cost

of increased computation. In this way, the limiting time accuracy with which accumulation

can be performed is equal to the FFT length divided by the overlap factor. A caveat to this

statement is discussed in §3.4.

2.2.3. Calibration for nominal telescope T

sys

Cross multiplication, accumulation and normalisation by the antenna autocorrelation

spectra gives the complex cross power spectrum for each baseline, representing the correlated

fraction of the geometric mean of the powers detected at each telescope. To obtain the

correlated power in units o f Jy, the cross power spectra (amplitude components) should be

scaled by the geometric mean of the powers received at each telescope measured in Jy i.e.

the T

sys

in Jy routinely measured at each antenna. Calibration based on the measured T

sys

typically p erfo r med as a post-correlation step in AIPS

or a similar data analysis package, and

so a nominal value for the T

sys

for each telescope is applied at the correlator. In addition, a

scaling factor to compensate for decorrelation due to the coarse quantisation of the ba seband

data is applied. This corrects the visibility amplitudes, but of course cannot recover the lost

signal to noise. For the 2-bit data typically processed, this scaling factor is 1/0.88 in the

low-correlation limit (Cooper 1970). The relationship becomes non-linear at high correlation

and the scaling factor approaches unity as t he correlation coeﬃent approaches unity. The

correction for high-correlation cases can be applied in post-processing, generally at the same

time as the application of measured T

sys

values.

http://www.aoc.nrao.edu/aips

– 12 –

2.2.4. Export of visibility data

Once an accumulation interval has been reached, the visibilities must be stored in a

useful f ormat. Presently, the software correlator supports RPFITS

as the output format.

RPFITS ﬁles can be loaded into ana lysis packages such as AIPS, CASA

, or MIRIAD

for data reduction. Ancillary information is included in the RPFITS ﬁle along with the

complex visibilities, time stamps, and (u,v,w) coor dinates. The RPFITS standard supports

the appending of a data weight to each spectral point, but DiFX does not currently record

weights. In the future, it is planned t o add additional widely used output formats, such as

FITS-IDI

2.3. Special processing operations: pulsar binning

Pulsed signals are dispersed as they travel through the interstellar medium (ISM), re-

sulting in a smearing of the pulse arrival time in frequency. In order to correct for the

dispersive eﬀects of the ISM, DiFX employs incoherent dedispersion (Voˆute et al. 2002).

This allows the visibilities generated by the correlator to be divided into pulse phase bins.

Unlike hardware correlators which typically allow only a single on/ oﬀ bin, or else employ

bins of ﬁxed width, DiFX allows an arbitrary number o f bins placed a t arbitary phase

int ervals. The individual bins can be written out separately in the RPFITS ﬁle format to

enable investigation of pulse phase dep endent eﬀects, or can be ﬁltered within the correlator

based on a priori pulse proﬁle information.

To calculate which phase bin a visibility at a given frequency and time corresponds

to, the software correlator requires information on the pulsar’s ephemeris, which is supplied

in the form of one o r more “p olyco” ﬁles containing a polynomial description of apparent

pulse phase as a function o f time. These are generated using the pulsar analysis program

TEMPO

, and require prior timing of a pulsar. Additional software has been written by

the authors to verify the pulsar timing, using the generated polyco ﬁles and the baseband

data (in MkV, LBA or K5 format) fr om an experiment, allowing phase bins to be accurately

http://www.atnf.csiro .au/computing/software/rpﬁts.html

http://casa.nrao.edu/

http://www.atnf.csiro .au/computing/software/miriad

http://www.aoc.nrao.edu/aips/FITS-IDI.html

http://pulsar .princeton.edu/tempo/reference

manual.html

– 13 –

set before correlation.

For VLBI observations of pulsars, it is usually desirable to maximise the signal to noise

of the observations by binning the visibilities based on the pulse phase, and applying a ﬁlter

to the binned output based on the signal strength in tha t phase. Typically this ﬁlter is

implemented as a binary on/oﬀ for each phase bin. Using the pulse proﬁle generated from

the baseband data of an observation, however, DiFX allows a user-speciﬁed number of bins to

be g enerated and a ﬁlter applied ba sed on pulse strength × bin width, allowing the maximum

theoretical retrieval of signal, as described below. This also reduces the output data volume,

since only an “integrated on-pulse” visibility is retained, rather than potentially many phase

bins.

Consider observing a single pulse, divided into M equally spaced phase bins. Let the

pulsar signal strength as a function of phase bin be S(m), and the noise in single phase bin

to be Z ×

√

M, where Z is the baseline sensitivity f or an integration time of a single pulse

period. When all bins ar e summed (eﬀectively no binning), the S/N ratio will be:

m=0

S(m)

(1)

as the signal adds coherently while the noise adds in quadrature. For a simple on/oﬀ gate

accepting only bins m

to m

, the S/N ratio will be:

m=m

S(m)

m=m



Z ×

(M)



(2)

Finally, for the case where each bin is weighted by the pulse signal strength in that bin,

the S/N ratio will be:

m=0

(S(m))

m=0



S(m) × Z ×

(M)



(3)

For a Gaussian shaped pulse, this allows a modest improvement in recovered signal to

noise of 6% compared to an optimally placed single on/oﬀ bin. On a more complicated

proﬁle, such as a Gaussian main pulse with a Gaussian interpulse at half the amplitude, the

improvement in recovered signal to noise increases to 21 %.

– 14 –

3. Software cor relation on the Swinburne Beowulf cluster - a case study

3.1. The cluster computing environment

The Swinburne University of Technology supercomputer is a ∼300 processor Beowulf

cluster, that is a mixture of commodity oﬀ-the-shelf desktop and server style PCs, connected

via a gig abit ethernet network. In particular, the supercomputer has ﬁve sub-clusters, each

with 48 machines. Four sub-clusters are made up of single processor 3.2 GHz, Pentium 4 PCs

with 1 GB of RAM per machine, while one sub-cluster is made up of dual processor Xeon

servers, each with 2 GB of RAM per machine. The cluster is continuously upgraded and

fully replaced approximately every 3–4 years. The software correlation code must operate in

this multi-user, multi-tasking, and highly dynamic environment.

3.2. Structure of the DiFX code

DiFX is written in C++, but makes heavy use of the optimised vector processing routines

provided by the Intel Perfor mance Primitive (IPP) library

. The use of this o ptimised vector

library results in a factor of several performance gain on the Intel CPUs, compared to non-

optimised vector code. Data transfer is handled via the Message Passing Interface (MPI)

standard

. The mpich implementation of MPI is used

Figure 1 shows the high-level class structure of DiFX, along with the data ﬂow. The

correlation is managed by a master node (FxManager), which instructs da ta management

nodes (Datastream) to send time ranges of baseband data to processing nodes (Core). The

data are then processed by the Core nodes, and the results sent back to the FxManager.

Double buﬀered, non-blocking communication is used to avoid latency delays and maximise

throughtput. Both the Datastream and Core classes can be (and have been) extended to

allow maximum code re-use when handling diﬀerent data formats and processing algorithms.

The Core nodes make use of an allocatable number of threads to maximise performance on

a heterogenous cluster.

The Datastream nodes can read the baseband data into their memory buﬀers from a local

disk, a networ k disk or a network socket. Once the data are loaded into the datastream buﬀer,

the remainder of the system is unaware of its origin. This is one of the most powerful aspects

http://www.intel.com/cd/softwa re/products/asmo-na/eng/perﬂib/ipp/index.htm

http://www-unix.mcs.anl.gov/mpi/

http://www-unix.mcs.anl.gov/mpi/mpich1/

– 15 –

of this correlator architecture, meaning the same correlator can easily be used for production

disk-based VLBI correlation and real-time eVLBI t esting, where the data is transmitted in

real time from the telescopes to the correlator over optical ﬁbre. Real-time eVLBI operational

modes have been tested using DiFX, transmitting data in real-time from the three ATNF

telescopes (Parkes, ATCA, and Mopra) to computing resources at the Swinburne University

of Technology and the University of Western Australia in Perth (a Cray XD-1 utilising

Opteron processors and on-board Xilinx FPGAs). The software correlator then correlates

the transmitted data in real-time. A full account of the new eVLBI capabilities of the

Australian VLBI array will be presented elsewhere (Phillips et al. 2007, in preparation).

3.3. Operating DiFX

DiFX is controlled via an interactive Graphical User Interface (GUI), which calls the

various component programs and helper scripts. The primary purpose of the GUI is to

facilitate easy editing of the text ﬁles which conﬁgure t he correlator, run external programs

such as the delay model generator, and provide feedback while a job is running. Two ﬁles

are necessary to run the actual correlator program. The ﬁrst is an experiment conﬁguration

ﬁle, containing tables of stations, frequency setups, etc, analo gous to a typical hardware

correlator job conﬁguration script. The second ﬁle contains the list of compute nodes on

which the correlator program will run.

While it is po ssible to run all tasks required to operate the correlator manually, in prac-

tise they are orga nised via the GUI. This consists of running a series of helper applications

from the GUI to generate the necessary input for the correlator. These include a script to

extract experiment information from the VLBI exchange (VEX) ﬁle used to conﬁgure and

schedule the telescopes at observe t ime, a delay and (u,v,w) generator which makes use of

CALC 9, and scripts to extract the current load of available nodes. Pulsar–speciﬁc info r ma-

tion such as pulse proﬁles and bin settings can also be loaded. This information is presented

via the GUI and adjustments to the conﬁguration, such as selection computational resources

to be used, can be made before launching a correlation jo b.

In the future it is planned to incorporate some real-time feedback of a mplitude, phase

and lag information from the current correlation via the GUI. This would be similar to the

visibility spectra displays available continuously at connected-element int erferometers.

– 16 –

3.4. Performance

In order to keep every compute node used in the correlation fully lo aded, they must be

kept supplied with raw data. If this condition is satisﬁed, we have a CPU-limited correlation,

and the addition o f further nodes will result in a linear performance gain. In practise,

however, at some point o bta ining data from the data source (network socket or disk) and

transmitting it across the local network to the processing nodes will no longer occur quickly

enough, and the correlation becomes data-limited rather than CPU-limited. Correct selection

of correlation parameters, and good cluster design, will minimise the networking overhead

imposed on a correlation job, and ensure that all compute nodes are fully utilised. This is

discussed in §3.4.1 below, and performance proﬁles for the CPU-limited case are presented

in §3.4.2.

3.4.1. Networking considerations

As described in §3.2, double-buﬀered communications to the processing nodes are used

to ensure that nodes ar e never idle as long as suﬃcient aggregate networ king capability is

available. The use of MPI communications adds a small but unavoidable overhead to data

transfer, meaning the maximum throughput of the system is slightly less than the maximum

network capacity on the most heavily loaded data path.

There ar e two signiﬁcant data ﬂows: out of each Datastream and into the FxManager.

For any high speed correlation, there will be more Core nodes than Datastream nodes, so

the aggregate rate into a Core will be lower than that out of a Data stream. The ﬂow out of

a Core is a factor of N

cores

times lower than that into the FxManager node.

If processing in real time (when processing time equals o bservation time), the rate

out of each Datastream will be equal to the recording rate, which can be up to 1 Gbps with

modern VLBI arrays and is within the capabilities o f modern commodity ethernet equipment.

The rate into the FxManager node will be equal to the product of the recording rate, the

compression ratio, and the number of Cores, where the compression ratio is the ratio of data

int o a Core to data out of a Core. This is determined by the number of antennas (since

number of baselines scales with number of antennas squared), the number of channels in

the output cross-power spectrum, the number of polarisation products correlated, and the

int egration t ime used before sending data back to t he FxManager node.

It is clearly desirable to maximise the size of data messages sent to a core for processing,

since this minimises the data rate into the FxManager node for a given number of Cores.

However, if the messages are too large, performance will suﬀer as RAM capacity is exceeded.

– 17 –

Network latency may also become problematic, even with buﬀering. Furthermore, it should

be apparent tha t in this architecture, the Cores act as short-term accumulators (STAs),

with the manager performing the long term accumulation. The length of the STA sets

the minimum integration time. It is important to note, however, that the STA interval is

entirely conﬁgurable in the software correlator, to be as short as a single FFT, although

network bandwidth and latency are likely to be limiting factors in this case.

For the majority of experiments it is possible to set a STA length which satisﬁes all the

network criteria and allows the Cores to be maximally utilised. For combinations of large

numbers of antennas and very high spectral and time resolution, however, it is impossible

to set an STA which allows a satisfactorily low return data rate to the FxManager node. In

this case, real time processing of the experiment is not possible without the installation of

additional network and/or CPU capacity on the FxManager node.

It is important to emphasise that although it is possible to ﬁnd experimental conﬁgura-

tions for which the software correlator suﬀers a reduction in perfo r mance, these conﬁgurations

wo uld be impossible on existing hardware correlators. If communication to the FxManager

node is limiting perfo r mance, it is also possible to parallelize a disk-based experiment by

dividing an experiment into several time ranges and processing these time ranges simultane-

ously, allowing an aggregate processing rate which equals real time. This is actually one o f

the most powerful aspects of the software correlator, and one which would allow scheduling

of correlation to always ensure the cluster was being fully utilised.

3.4.2. CPU-limited performance

Figure 2 shows the results of performance testing on the Swinburne cluster (using the

3.2 GHz Pent ium 4 machines and the giga bit ethernet network) for diﬀerent array sizes and

spectral resolutions. The results shown in Figure 2 were obtained for data for which the

aggregate bandwidth was 64 MHz, broken up into 8 bands each of 8 MHz bandwidth (4 ×

dual polarisation 8 MHz bands: data were 2-bit sampled: antenna data rate 256 Mbps).

Node requirements for real-time operation are extrapolated from the compute time on an

8 node cluster. The correlation integration time is 1 second and all correlations provide all

four polarisation products. RAM requirements per node ranged from 10 – 50 MB depending

on spectral resolution, showing that large amounts of RAM are unnecessary for typical

correlations. It can be seen that even a modestly sized commodity cluster can process a

VLBI-sized arr ay in real time at currently ava ila ble data rates.

– 18 –

3.5. Correlator comparison results

3.5.1. Comparison with ATNF S2 correlator

Observations to provide data for a correlator comparison between the Swinburne soft-

wa r e correlator and the ATNF S2 correlator were undertaken on March 12, 2006, with the

following subset of the LBA: Parkes (64 m), ATCA (phased array of 5 × 22 m), Mopra (22

m), Hobart (26 m).

Data from these observations were recorded simultaneously to S2 tapes and the L BADR

disks (Phillips et al. 2 007, in preparation) during a 20 minute period, UT 02:30–02:50, cor-

responding to a scan on a bright quasar (PKS 0208−512). The data recorded corresponded

to two 16 MHz bands, right circular pola r isation (RCP), in the f requency r anges 2252 −

2268 MHz and 2268 − 2284 MHz.

The data recorded on S2 tapes were shipped to the ATNF LBA S2 correlator (Roberts

1997) a t ATNF headquarters and processed. The data recorded to LBADR disks were

shipped t o the Swinburne University of Technology supercomputer and processed using the

software correlator.

At both correlators ident ical T

sys

values in Jy were speciﬁed for each antenna and applied

in order to produce nominally calibrated visibility amplitudes. Further, both correlators used

identical clock models, in the form of a single clock oﬀset and linear rate as a function of time

per antenna. Finally, the data were processed at each correlator using 2 second correlator

int egration t imes and 32 spectral channels across each 16 MHz band.

Diﬀerent implementations of the CALC-based delay generation were used a t each corre-

lator, meaning small diﬀerences exist in the delay models used, leading t o diﬀerences in the

correlated visibility phase. We have calculated the delay mo del diﬀerences and subtracted

the phase due to diﬀerential delay model in the following discussion.

From both correlators, RPFITS fo r mat data were output and loaded into the MIRIAD

software (Sault, Teuben, & Wright 1995) for insp ection and analysis. The data from the two

correlators are compared in a series of F ig ures below (Figures 3 – 5).

Figure 3 shows the visibility amplitudes f or all baselines from both correlators as a

function of time, over the period 02:36:00 - 02:45:00 UT, for one of the 16 MHz bands (2252

− 2268 MHz). These amplitudes represent the vector averaged data over the frequency

channel range 10 − 21 (to avoid the edges of the band). The data for each baseline were

ﬁt to a ﬁrst order polynomial model (S(t) =

t + S

, where S is the ﬂux density in Jy, t

is the o ﬀ set in seconds from UT 02 :4 0:30, and S

is the extrapolated ﬂux density at time

– 19 –

UT 02:40:30, using a standard linear least squares routine. The root mean square (RMS)

variation around the best ﬁt model was calculated for each baseline. The ﬁtted models are

shown in Figure 3 and show no signiﬁcant diﬀerences between the S2 correlator and the

software correlator. Further, the calculated RMS for each baseline agrees very well between

DiFX and the S2 correlator, as summarised in Table 3.

Figure 4 shows the visibility phase as a function of time for each of the six baselines in

the array. Again the data represent the vector averaged correlator output over the frequency

channel range 10 − 21 within the 2252 − 2268 MHz band. As discussed above, small

diﬀerences between the delay models used at each correlator have been taken into account

as part of this comparison.

Figure 5 shows a comparison of the visibility amplitudes and phases as a function of

frequency in the 2252 − 2268 MHz band. The data represented here result from a vector

average of the two datasets over a two minute time range, UT 02:40:00 − 02:42:00. Since

the S2 correlator is an XF – style correlator, it cannot exactly correct fractional sample error

in the same manner as an FX correlator such as DiFX, as the channelisation is performed

after accumulatio n. The coarse (post-accumulation) fractional sample correction leads to

decorrelation at all points except the band center, up to a maximum of ∼ 10% at the band

edges on long baselines where the geometric delay changes by a sample or more over an

int egration period. We have corrected for this band edge decorrelation in the S2 correlator

amplitudes in Figure 5.

3.5.2. Comparison with the VLBA correlator

Data obtained as part of a regular series of VLBA test observations were used as a

basis for a correlator comparison between the software correlator and the VLBA correlator

(Napier et al. 1994). The observations were made on 2006 August 05 using the Brewster,

Los Alamos, Mauna Kea, Owens Valley, Pie Town, and Saint Croix VLBA stations. One bit

digitised data sampled at the Nyquist rate for four dual polarisation bands, each of 8 MHz

bandwidth, were recorded using the Mk5 system (Whitney 2003). The four bands were at

centre frequencies of 22 79.49, 2287.49, 2295.49, and 2303.49 MHz. The exp eriment code for

the observations was MT628 and the source observed was 092 3+392, a strong and compact

active galactic nucleus. Approximately two minutes of data recorded in this way was used

for the comparison.

The Mk5 data were correlated on the VLBA correlator and exported to FITS f ormat

ﬁles. The data were also shipped to the Swinburne supercomputer and correlated using the

– 20 –

software correlator, the correlated data exported to RPFITS format ﬁles. In both cases, no

scaling of the correlated visibility amplitudes by the system temperatures were made at the

correlators. The visibilities remained in the form of correlation coeﬃcients for the purposes

of the comparison i.e. a system temperature of unity was used to scale the a mplitudes. Each

8 MHz band was correlated with 64 spectral points, and an integration time of 2.048 seconds

wa s used.

The VLBA correlator data were read into AIPS using FITLD with the parameter

DIGICOR=1. The DIGICOR parameter is used to apply certain scalings to the visibil-

ity amplitudes for data from the VLBA correlato r . Further, to obtain the most accurate

scaling of the visibility amplitudes, the task ACCOR was used to correct for imperfect sam-

pler thresholds, deriving corrections to the antenna-based a mplitudes of ∼ 0.5%. These

ACCOR corrections were applied to the data and the data were written to disk in FITS

format.

The software correlator data were read directly into AIPS and then written to disk in

the same FITS format as the VLBA correlator data. No corrections to amplitude or phase

of the software correlated data were made in AIPS.

The VLBA correlator data and the software correlator data were b oth imported into

MIRIAD for inspection and analysis, using the same software as used for the comparison

with the LBA correlator described above. RCP from the 2283.49 – 2291.4 9 MHz band over

the time range UT 17:49:00 − 17:51:00 was used in all comparison plots below.

Since the delay models used by the VLBA and software correlators diﬀer at the pi-

cosecond level, as is the case for the comparison with the LBA data in §3.5.1, diﬀerences in

the visibility phase exist between the correlated datasets. As with the LBA comparison, we

have compensated for the phase error due to the delay models diﬀerences in the following

comparison.

Figure 6 shows the visibility amplitudes f or all baselines from both correlators as a

function of time. These amplitudes represent the vector averaged data over the frequency

channel range 10 − 55 (to avoid the edges of the band). The data for each baseline were ﬁt to

a ﬁrst order polynomial model (S(t) =

t+S

, where S is the correlation coeﬃcient, t is the

oﬀset in seconds from UT 17:50:00, and S

is the extrapolated correlation coeﬃcient at time

UT 17:5 0:00) using a standard linear least squares routine. The root mean square (RMS)

variation around the b est ﬁt model was calculated for each baseline. The ﬁtted models

are shown in Figure 6 and show no signiﬁcant diﬀerences between the VLBA correlator

and the software correlator. Further, the calculated RMS for each baseline agrees very well

between the VLBA correlator and the software correlator. The results of the comparison are

– 21 –

summarised in Table 4.

Figure 7 shows the visibility phase as a function of time fo r each of the ﬁfteen baselines in

the array. Again the data represent the vector averaged correlator output over the frequency

channel range 1 0 − 55 within the band. As discussed above, small diﬀerences between the

delay models used at each correlator cause phase oﬀsets between the two correlators, and

have been taken into account as part of this comparison.

Figure 8 shows a comparison of the visibility amplitudes and phases as a function of

frequency in the band. The data represented here result from a vector average of the two

datasets over a two minute time ra nge. Figures 6, 7 and 8 show that the results obtained

by the VLBA correlator and DiFX agree to within the RMS errors of the visibilities in each

case, as expected.

4. Scient iﬁc applications of t he Swinburne software correlator

4.1. High frequency resolution spectral line VLBI

As mentioned in the introduction, an attractive feature of software correlation is the ease

with which very high spectral resolution correlation can be undertaken. This is particularly

useful for studies of spectral line sources such as masers when mapping the distribution of the

masing regions and their kinematics i.e. near black holes in galactic nuclei ( Greenhill et al.

1995).

Figure 9 shows a spectrum obtained from an LBA observation of the OH maser G345−0.2.

These observations were made with an array consisting of the ATCA (phased array of 5 × 22

m), Parkes (64 m), and Mopra (22 m), recording data from a dual-polarised (RCP and LCP)

4 MHz band onto hard disk. The data were correlated using the software correlato r with

16,384 frequency channels across the 4 MHz band, corresponding to 0.25 kHz per channel

or 0.038 km/s velocity resolution at 1 .7 2 GHz.

These results compare with recent very high spectral resolution work done with the

VLBA. Fish et a l. (2006) o bserved OH masers with the VLBA, using a 62.5 kHz bandwidth

and 512 channels across this band to obtain channel widths of 0.122 kHz or 0.02 km/s velocity

resolution. The velocity resolution of this correlated dataset is almost twice as good as that

shown in Figure 9 . However, the VLBA bandwidth is only 0.016 times the bandwidth of the

observations shown in Figure 9.

If required, DiFX could have correlated these data with 32,768 channels, 65,536 channels

or even higher numbers of channels. As mentioned in the introduction, the only penalty is

– 22 –

compute time on a resource with a ﬁxed number of processing elements. DiFX therefore has

a clear advantage over existing hardware correlators in terms of producing very high spectral

resolution over wide bandwidths. This capability is useful if the velocity distribution of an

ensemble of masers in a ﬁeld is broa d and cannot be contained in a single narrow bandwidth.

4.2. Correlation for wide ﬁelds of view

An application t hat takes advantage of the frequency and time resolution of the software

correlator output is wide ﬁeld imaging. To image a wide ﬁeld of view, avoiding the eﬀects

of time and bandwidth smearing, high spectral and temporal resolution is required in the

correlator visibility output. For example, at VLBI resolution (40 mas), to image the full

primary beam of an Australia Telescope Compact Array (ATCA) antenna (22 m diameter)

at a frequency of 1.4 GHz, requires a time resolution of the correlator output of 50 ms and a

frequency resolution of 4 kHz (allowing a 0.75 % smearing loss at the F WHM of the primary

beam).

Neither the JIVE nor the VLBA har dware correlators can achieve such high frequency

or t ime resolution for continuum experiments, but DiFX can b e conﬁgured for such modes

in an identical manner to a normal continuum experiment.

4.3. Pulsar studies

As compact sources with high velocities, pulsars make excellent testb eds with which

to pro be the structure of the interstellar medium (ISM). Scintillation due to structure in a

scattering screen between the observer and the pulsar causes va r ia t io ns in the interferometric

visibilities, which have some dependence on time and frequency (e.g. Hewish et al. 1985).

Naturally, pulsar binning is advantageous in these studies for maximising signal to noise

ratios.

The most stringent requirement for useful studies of pulsar scintillation, however, is that

of extremely high frequency resolution. Brisken et a l. (2007, in preparation) have recently

demonstrated the capabilities of DiFX for this type of analysis with observations of the

pulsar B0834−04. The NRAO Green Bank Telescope (100 m), Westerbork (14 × 25 m),

Jodrell Ba nk (76 m), and Arecibo (305 m) were used to provide an ultra-sensitive array at

327 MHz. The data were recorded using the Mk5 system and correlated on the Swinburne

software correlator. The main requirement on the correlation was 0.25 kHz wide frequency

channels, over the broadest bandwidth available, to maximise signal to noise. For these

– 23 –

observations a 32 MHz band wa s available. The Swinburne software correlator therefore

correlated the data with 131,072 frequency channels across the band.

No existing hardware correlator can provide such a high frequency resolution over such a

wide bandwidth. Full details of the interpretation o f the B0834 −04 software correlated data

will be ava ila ble in Brisken et al. (2007, in preparation). Shown in Figure 10 is a section

of the dynamic spectrum from this observatio n which shows the scintillation structure as

functions of time and frequency.

4.4. Geodetic VLBI

In addition to astronomical VLBI, the software correlator can also be deployed for

geodetic VLBI. Compared to astronomical VLBI, geodetic VLBI has additional requirements,

including diﬀerent output formats a nd the frequent use of sub-arraying. The ﬂexibility and

capabilities of the software correlator ar e well-matched to this ta sk.

The software correlator has been tested on geodetic datasets obtained using the Mk5

recording system, consisting of 16 frequency bands. These tests form the basis of a geodetic

correlation comparison between the software correlator and the geodetic correlator of the

Max Plank Institut of Radioastronomie in Bonn, Germany. Full results of this correlator

comparison will be reported elsewhere (Tingay et al. 2007, in preparation).

In particular, in Australia a new three-station geodetic VLBI array has been funded

as part of the geospatial component of the Federal Government’s National Collab orative

Research Infrastructure Scheme (NCRIS). This scheme provides for three new g eodetic VLBI

stations of 12 m diameter, Mk5 recording systems, and a modiﬁed version of t he software

correlator described in this paper. The modiﬁcations necessary to convert DiFX into a

geodetic correlato r consist o f the addition of phase calibration tone extraction, a streamlined

int erfa ce to scan-by-scan correlation for sub-arraying, and a capability to produce visibilities

in a format convenient for geodetic post-processing.

The new Australian geodetic VLBI array will participate in global geodetic observations,

as well as undertaking experiments internal to the Australian tectonic plate.

5. Conclusions

In this paper we have outlined the main beneﬁts of software correlation for small to

medium sized VLBI arrays. They are:

– 24 –

• The development of software correlation is rapid and does not depend on an intimate

knowledge of digital signal processing har dware, just the algorithms;

• The software is ﬂexible and scalable to accommo dat e a very broad ra nge of int erfero-

metric modes of observation, including many which cannot be supported by existing

ASIC-based hardware correlators. Software correlators are therefore ideal for novel

experiments with very special requirements. The main trade-oﬀ for improved perfor-

mance with a software correlator is the increase in compute time for a ﬁxed number of

processing elements, or the addition of extra processing elements;

• The software can easily incorporate data recorded using mixed disk-based recording

hardwar e;

• Medium to large multi-processor computing facilities are available at almost all uni-

versity and government research institutions, allowing users easy entry into VLBI cor-

relation;

• The correlation algorithm is highly parallel and very well suited to a parallel multi-

processor computing environment;

• The cost of commodity computing continues to fall with time, making large parallel

computing facilities more powerful and less expensive;

• Once written, the code can be port ed to a wide r ange of platforms and recompiled

with minimal eﬀort.

We have discussed the implementation of the DiFX software correlator on a standard

Beowulf cluster at the Swinburne University of Technology and have provided performance

ﬁgures-of-merit for this implementation, showing that relatively large numbers of telescopes

and relatively high data r ates can be correlated in “ real-time” using numbers of machines

that do no t exceed the capabilities of moderate to large Beowulf clusters. Clear trade-oﬀs are

possible in many areas of performance. For example, if r eal-time operation is not important

it is possible to dramatically reduce the number of processing elements.

We have also showed the results of comprehensive testing of the software correlator,

comparing it output to that of two established hardware correlators, the S2 correlator of

the Australian Long Baseline Array, operated by the ATNF, and the VLBA correlator. The

correlator comparisons of visibility amplitude and phase as functions of time and frequency

verify that DiFX is operating correctly for astronomical VLBI observations.

DiFX now supports all Australian VLBI observations and some global VLBI experi-

ments, at dat a rates up to 1 Gbps per telescope. The DiFX code can be downloaded from

– 25 –

http://astronomy.swin.edu.au/~adeller/software/difx/. A number of scientiﬁc pro-

grams have already been supported by the software correlator and are brieﬂy discussed here.

Further, a modiﬁed version o f the software correlator will be used to support a new VLBI

array in Australia, dedicated to local and global geodetic observations.

This work has been supported by the Australian Federal Government’s Major National

Research Facilities pro gram, the Australian Research Council’s (ARC) Strategic Research

Initiatives (eResearch) scheme, and the ARC’s Discovery Projects scheme. ATD is supported

via a Swinburne University of Technology Chancellor’s Research Scholarship and a CSIRO

postgraduate scholarship. The Long Baseline Array is part of the Australia Telescope which

is funded by the Commonwealth of Australia for operation as a National Facility managed

by CSIRO. The National Radio Astronomy Observatory is a facility of the National Science

Foundation operated under cooperative agr eement by Associated Universities, Inc. We wish

to thank Walter Brisken for kindly making available Figure 10 prior to publication, the

NRAO (Walter Brisken, Craig Walker, Jon Romney) for making available data for the VLBA

correlator comparison and Ga ry Scott for correlating the LBA S2 data for the comparison

with the ATNF S2 correlator.

REFERENCES

Bailes, M. 2003, ASP Conf. Ser. 302: Radio Pulsars, 302, 57

Bare, C. et al. 19 67, Science, 157, 18 9

Bellanger, M. & Daguet, J., IEEE Trans. Commun. Com-22(9), 11991205

Brisken, W. et al. 2007, in preparation

Carlson, B.R. et al. 19 99, PASP, 111, 1025

Casse, J.L. 1999, New Ast. Rev., 43, 503

Clark, B.G. 1973 , Proc. IEEE, 61, 1242

Clark, B.G., Cohen, M.H. & Jauncey, D.L. 1967 , ApJ, 149, L15 1

Cooper, B. F. C. 1970, Australian Journal of Physics, 23, 521

2000, Radio Int erferometry: The Saga and the Science,” Proceedings of a Symposium Hon-

oring Barry Clark at 60, ed. D. G. Finley & W. M. Goss, NRAO Workshop Number

27, Associated Universities Inc.

– 26 –

Harp, G.R. 2002, in Advanced Telescope and Instrumentatio n Control Software II. eds L.

Hilton. Proceedings of the SPIE, vol 4848, 1

Hewish, A., Wolszczan, A., & Graham, D. A. 1985, MNRAS, 213, 167

Horiuchi, S. et al. 2006, ApJS(submitted)

Horiuchi, S. et al. 2000, Advances in Space Research, 26, 625

Greenhill, L.J. et al. 1995, ApJ, 440, 619

Kondo, T. et al. 2003, in New technologies in VLBI, ASP Conf. Ser., Vol. 306. ed Y.C. Minh,

San Francisco, CA: Astronomical Society of the Paciﬁc

Moran, J.M. et al. 19 67, Science, 157, 67 6

Napier, P.J., Bagri, D.S., Clark, B.G., R ogers, A.E.E., Romney J.D., Thompson, A.R. &

Walker, R.C., Proc. IEEE, 82, 658

Phillips, C. et al. 2007, in preparation

Pogrebenko, S. 2003, in Workshop on Planetary Probe Atmospheric Entry and descent

Traj ectory Analysis and Science, ed A. Wilson, ESA

Roberts, P.P. 1997, Astron. Astrophys. Suppl. Ser., 126 , 379

Rogers, A.E.E. et al. 1983, Science, 219, 51

Romney, J. D. 1999, ASP Conf. Ser. 180: Synthesis Imaging in Radio Astronomy II, 18 0, 57

Sault, R.J., Teuben, P.J. & Wright, M.C.H. 1995, ASPC, 77, 433

Tingay, S.J. et al 20 07, in preparation

Thompson, A.R., Moran, J.M. & Swenson, G.W. 1994, Interferometry and Synthesis in

Radio Astronomy, Kreiger Publishing Company

Thompson, A. R. 199 9, ASP Conf. Ser. 180: Synthesis Imaging in Radio Astronomy II, 180,

Voˆute, J.L.L. et al. 2002, A&A, 385, 73 3

West, C. 2004, M.Sc. thesis, Swinburne University o f Technology

Whitney, A.R. 2003, ASPC, 306, 123

– 27 –

Whitney, A.R. 2 002, in Proceedings of the 6t h European VLBI Network Symposium, eds.

Ros, E., Porcas, R.W., L obanov, A.P., & Zensus, J.A., 41

Whitney, A.R. 1993, in IAU Symp. 156: Developments in Astrometry and their Impact on

Astrophysics and Geodynamics, 151

Wietfeldt, R. et al. 1996, IEEE Transactions on Instrumentation and Measurement, 45(6),

923

Wilson, W., Roberts, P., Davis, E. 1996, in Proceedings of the 4th APT Workshop, ed E.

King, 16

This preprint was prepared with the AAS L

X macros v5.2.

– 28 –

Table 1. Comparison of existing hardware correlator parameters

Correlator Type Maximum telescopes Maximum channels Minimum integration time Maximum input data rate Maximum output data rate Pulsar binning

(in one correlator pass) (per baseline) (ms) (Mbps) (MB/s)

VLBA

FX 20 2048 131.072 256 1 yes

JIVE

XF 16 2048

125

1024 6

ATNF S2

XF 6 8192

2000 128 0.064 yes

http://www.vlba.nrao.edu/astro/obstatus/current/node28.html

http://www.jive.nl/correlator/status.html

for up to 8 telescopes

when using half the correlator

data in lag space

http://www.atnf.csiro.au/vlbi/correlator/

0.5 MHz bandwidth, 2 products

– 29 –

Table 2. Maximum decorrelation incurred due to “Post-F” fringe rotation

Observation Max. baseline Frequency # channels/16MHz band Max. decorrelation

(km) (MHz) (%)

LBA low frequency continuum 1400 1600 128 0.003

LBA high frequency continuum 1700 8400 128 0.13

VLBA low frequency continuum 8600 1600 128 0.12

VLBA high frequency continuum 8600 22200 128 21.1

LBA water masers 1700 22200 1024 47.6

– 30 –

Table 3. Linear ﬁt parameters for visibility amplitude vs time for DiFX and the LBA S2

correlator, with 95% conﬁdence limits

Baseline Oﬀset

DiFX

(Jy) Oﬀset

LBA

(Jy) Slope

DiFX

(µJy s

−1

) Slope

LBA

(µJy s

−1

)

PKS - NAR 1.341 ± 0.030 1.343 ± 0.028 10 ± 13 14 ± 12

PKS - MOP 3.185 ± 0.058 3.185 ± 0.063 14 ±24 −11 ± 26

PKS - HOB 2.307 ± 0.058 2.293 ± 0.061 −12 ± 24 − 6 ± 24

NAR - MOP 1.616 ± 0.109 1.619 ± 0.114 −27 ± 43 −10 ± 45

NAR - HOB 1.142 ± 0.111 1.139 ± 0.116 − 3 ± 44 − 5 ± 46

MOP - HOB 2.694 ± 0.256 2.681 ± 0.257 18 ± 101 56 ± 101

– 31 –

Table 4. Linear ﬁt parameters for visibility amplitude (in units of correlation coeﬃcient)

vs time for DiFX and the VLBA correlator , with 95% conﬁdence limits

Baseline Oﬀset

DiFX

Oﬀset

VLBA

Slope

DiFX

−1

× 10

−6

) Slope

VLBA

−1

× 10

−6

)

BR - LA 0.0104 ± 0.0004 0.0103 ± 0.0005 −0.8 ± 1.7 −0.9 ± 1.7

BR - MK 0.0072 ± 0.0005 0.0071 ± 0.0006 0.1 ± 1.8 0.5 ± 2.0

BR - OV 0.0125 ± 0.0005 0.0124 ± 0.0005 −0.7 ± 1.7 −0.5 ± 1.8

BR - PT 0.0090 ± 0.0004 0.0089 ± 0.0004 −1.0 ± 1.3 −1.2 ± 1.5

BR - SC 0.0069 ± 0.0005 0.0069 ± 0.0005 −3.1 ± 2.0 −2.5 ± 1.8

LA - MK 0.0059 ± 0.0005 0.0059 ± 0.0005 1.9 ± 1.7 1.4 ± 1.7

LA - OV 0.0101 ± 0.0005 0.0100 ± 0.0005 0.4 ± 1.7 0.6 ± 1.7

LA - PT 0.0073 ± 0.0005 0.0 072 ± 0.0005 −0.3 ± 1.7 −0.5 ± 1.8

LA - SC 0.0058 ± 0.0004 0.0058 ± 0.0004 −1.8 ± 1.5 −1.9 ± 1.5

MK - OV 0.0078 ±0.0004 0.0077 ± 0.0005 0.9 ± 1.5 0.3 ± 1.8

MK - PT 0.0044 ± 0.0004 0 .0 044 ± 0.0004 −0.6 ± 1.7 −0.3 ± 1.5

MK - SC 0.0028 ± 0.0005 0.0028 ± 0.0005 −0.6 ± 1.8 −0.7 ± 1.7

OV - PT 0.008 3 ± 0.0005 0.0082 ± 0 .0 005 −1.8 ± 1.8 −1.9 ± 1.7

OV - SC 0.0062 ± 0.0005 0.0062 ± 0.0005 −0.3 ± 1.8 −0.2 ± 1.8

PT - SC 0.0055 ± 0.0005 0.0055 ± 0.0005 −1.7 ± 2.0 −1.3 ± 1.8

– 32 –

Fig. 1.— Overview of the software correlator architecture. Data is loaded into memory from

a disk or network connection by Datastream nodes. These nodes are directed by a Master

node to send data from given time r anges (typically several ms) to the processing elements

(Core nodes). The processed data are sent to the master node for long-term accumulation

and storage on disk.

– 33 –

Fig. 2.— Benchmark data showing the computational requirements of DiFX to correlate in

real-time, as described in the text. The nodes are single core 3.2 G Hz Pent ium processors

with 1 GB RAM, and in both benchmarks 64 MHz of total bandwidth per station was

correlated with a 1 second integration period. Top panel shows the scaling of computational

requirements with number of antenna, using 256 spectral po ints per 8 MHz subband. Bottom

panel shows the scaling o f computional r equirements with spectral points per subbband for

a ten station array.

– 34 –

Fig. 3.— S2 (red) and DiFX (black) visibility amplitude vs time for the 2252 – 2268 MHz

band on the source PKS 0208−512, as described in the text (PKS = Parkes; MOP = Mopra;

HOB = Hobart; NAR = ATCA). Symbols represent the actual visibilities produced by the

correlators, while the lines represent linear least-squares ﬁts to the visibilities (one line per

dataset).

– 35 –

Fig. 4.— S2 (red) and DiFX (black) visibility phase vs time for the 2252 – 2268 MHz band

on the source PKS 0208−512, a s described in the text. Antenna labels as in Figure 2 above.

The PKS-NAR baseline has been shifted by −50 deg for clarity.

– 36 –

Fig. 5.— S2 (red) and DiFX (black) visibility amplitude and phase vs frequency data for

the 2252 – 2268 MHz band on the source PKS 0208−512, as described in the text. Antenna

labels as in Figure 2 above. The S2 data has been corrected for fractional-sample error

decorrelation at the band edges as described in the text.

– 37 –

Fig. 6.— VLBA correlator (red) and DiFX (black) visibility amplitude vs time for the

2283.49 – 2291.49 RCP band from the VLBA test observation MT628, as described in the

text. The units of time are seconds from UT 00:00:00, and the amplitude scale is correlation

coeﬃcient. Symbols represent the actual visibilities produced by the correlators, while the

lines represent linear least-squares ﬁts to the visibilities. The text annotation on each panel

lists t he average correlation coeﬃcient amplitude for each correlator over the time period,

as ta bulated in Table 4.

– 38 –

Fig. 7.— VLBA correlator (red) and DiFX (black) visibility phase vs time for the 2283.49 –

2291.49 RCP band from the VLBA test observation MT628, as described in the text. The

units of time are seconds from UT 00:00:00, a nd phase is displayed in degrees.

– 39 –

Fig. 8.— VLBA correlator (red) and DiFX (black) visibility amplitude and phase as a

function of frequency fo r the 2283.49 – 2291.49 RCP band from the VLBA test observation

MT628, as described in the text. The vertical scale for correlation coeﬃcient amplitude on

each panel is 0 – 0.018, while the phase scale spans ±180 deg. The horizontal scale for each

panel displays channels 0–64.

– 40 –

Fig. 9.— A two minute average of the ATCA – Parkes cross-power spectrum taken f rom the

software correlated data for t he OH maser G345−0.2, as described in the text. The velocity

resolution is 0.038 km/s at the central frequency of 1.72 GHz. The light gray line showing

strong maser emission represents the LCP data and the dark gray line with little emission

represents the RCP data. The maser is highly circularly polarised.

– 41 –

Fig. 10.— The cross-power dynamic spectrum showing scintillation variations for the pulsar

B0834−04 on the Green Bank Telescope – Arecibo baseline. Brightness represents the visi-

bility amplitude and colour represents the visibility phase. Increasing frequency runs left to

right and increasing time runs top to bo t t om. This section of the dynamic spectrum repre-

sents just 5% of the time span and 0.5% of the bandwidth of the observation (330 seconds

and 1 60 kHz).

Swift J1727.8-1613 has the Largest Resolved Continuous Jet Ever Seen in an X-ray Binary

Preprint

Full-text available

May 2024

Multi-wavelength polarimetry and radio observations of Swift J1727.8-1613 at the beginning of its recent 2023 outburst suggested the presence of a bright compact jet aligned in the north-south direction, which could not be confirmed without high angular resolution images. Using the Very Long Baseline Array and the Long Baseline Array, we imaged Swift J1727.8-1613, during the hard/hard-intermediate state, revealing a bright core and a large, two-sided, asymmetrical, resolved jet. The jet extends in the north-south direction, at a position angle of $-0.60\pm0.07\deg$ East of North. At 8.4 GHz, the entire resolved jet structure is $\sim110 (d/2.7\,\text{kpc})/\sin i$ AU long, with the southern approaching jet extending $\sim80 (d/2.7\,\text{kpc})/\sin i$ AU from the core, where $d$ is the distance to the source and $i$ is the inclination of the jet axis to the line of sight. These images reveal the most resolved continuous X-ray binary jet, and possibly the most physically extended continuous X-ray binary jet ever observed. Based on the brightness ratio of the approaching and receding jets, we put a lower limit on the intrinsic jet speed of $\beta\geq0.27$ and an upper limit on the jet inclination of $i\leq74\deg$. In our first observation we also detected a rapidly fading discrete jet knot $66.89\pm0.04$ mas south of the core, with a proper motion of $0.66\pm0.05$ mas hour$^{-1}$, which we interpret as the result of a downstream internal shock or a jet-ISM interaction, as opposed to a transient relativistic jet launched at the beginning of the outburst.

On the Structure of the Sagittarius Spiral Arm in the Inner Milky Way

Preprint

Full-text available

May 2024

We report measurements of trigonometric parallax and proper motion for two 6.7 GHz methanol and two 22 GHz water masers located in the far portion of the Sagittarius spiral arm as part of the BeSSeL Survey. Distances for these sources are estimated from parallax measurements combined with 3-dimensional kinematic distances. The distances of G033.64$-$00.22, G035.57$-$00.03, G041.15$-$00.20, and G043.89$-$00.78 are $9.9\pm0.5$, $10.2\pm0.6$, $7.6\pm0.5$, and $7.5\pm0.3$ kpc, respectively. Based on these measurements, we suggest that the Sagittarius arm segment beyond about 8 kpc from the Sun in the first Galactic quadrant should be adjusted radially outward relative to previous models. This supports the suggestion of Xu et al. (2023) that the Sagittarius and Perseus spiral arms might merge in the first quadrant before spiraling inward to the far end of the Galactic bar.

On the Structure of the Sagittarius Spiral Arm in the Inner Milky Way

Article

Full-text available

May 2024

We report measurements of trigonometric parallax and proper motion for two 6.7 GHz methanol and two 22 GHz water masers located in the far portion of the Sagittarius spiral arm as part of the BeSSeL Survey. Distances for these sources are estimated from parallax measurements combined with three-dimensional kinematic distances. The distances of G033.64−00.22, G035.57−00.03, G041.15−00.20, and G043.89−00.78 are 9.9 ± 0.5, 10.2 ± 0.6, 7.6 ± 0.5, and 7.5 ± 0.3 kpc, respectively. Based on these measurements, we suggest that the Sagittarius arm segment beyond about 8 kpc from the Sun in the first Galactic quadrant should be adjusted radially outward relative to previous models. This supports the suggestion of Xu et al. that the Sagittarius and Perseus spiral arms might merge in the first quadrant before spiraling inward to the far end of the Galactic bar.

Shanghai Tianma Radio Telescope and Its Role in Pulsar Astronomy

Article

Full-text available

Apr 2024

After two phases of on-site construction and testing (2010–2013 and 2013–2017), the Shanghai Tianma Radio Telescope (TMRT) can work well, with efficiencies better than 50% from 1.3 to 50.0 GHz, mainly benefiting from its low-noise cryogenic receivers and active surface system. Pulsars were chosen as important targets of research at the TMRT because of their important scientific and applied values. To meet the demands of pulsar-related observations, TMRT is equipped with some necessary backends, including a digital backend system (DIBAS) supporting normal pulsar observation modes, a real-time fast-radio-burst-monitoring backend, and baseband backends for very-long-baseline interferometry (VLBI) observations. Utilizing its high sensitivity and simultaneous dual-frequency observation capacity, a sequence of pulsar research endeavors has been undertaken, such as long-term pulsar timing, magnetar monitoring, multi-frequency (or high-frequency) observations, interstellar scintillation, pulsar VLBI, etc. In this paper, we give a short introduction about pulsar observation systems at the TMRT and briefly review the results obtained by these pulsar research projects.

A Study of the Radio Spectrum of Mrk 421

Article

Full-text available

Mar 2024

We present the results of a spectral analysis using simultaneous multifrequency (22, 43, 86, and 129 GHz) very long baseline interferometry (VLBI) observations of the Korean VLBI Network on BL Lac object, Markarian 421. The data we used were obtained from 2013 January to 2018 June. The light curves showed several flux enhancements with global decreases. To separate the variable and quiescent components in the multifrequency light curves for milliarcsecond-scale emission regions, we assumed that the quiescent radiation comes from the emission regions radiating constant optically thin synchrotron emissions (i.e., a minimum flux density with an optically thin spectral index). The quiescent spectrum determined from the multifrequency light curves was subtracted from the total CLEAN flux density, yielding a variable component in the flux that produces the time-dependent spectrum. We found that the observed spectra were flat at 22–43 GHz, and relatively steep at 43–86 GHz, whereas the quiescent-corrected spectra are sometimes quite different from the observed spectra (e.g., sometimes inverted at 22–43 GHz). The quiescent-corrected spectral indices were much more variable than the observed spectral indices. This spectral investigation implies that the quiescent-spectrum correction can significantly affect the multifrequency spectral index of variable compact radio sources such as blazars. Therefore, the synchrotron self-absorption B-field strength ( B SSA ) can be significantly affected because B SSA is proportional to the fifth power of turnover frequency.

Toward Microarcsecond Astrometry for the Innermost Wobbling Jet of the BL Lacertae Object OJ 287

Article

Full-text available

Sep 2023

The BL Lacertae object OJ 287 is a very unusual quasar producing a wobbling radio jet and some double-peaked optical outbursts with a possible period of about 12 yr for more than one century. This variability is widely explained by models of binary supermassive black holes (SMBHs) or precessing jets/disks from a single SMBH. To enable an independent and nearly bias-free investigation on these possible scenarios, we explored the feasibility of extremely high-precision differential astrometry on its innermost restless jet at millimeter wavelengths. Through revisiting some existing radio surveys and very long baseline interferometry (VLBI) data at frequencies from 1.4 to 15.4 GHz and performing new Very Long Baseline Array observations at 43.2 GHz, we find that the radio source J0854+1959, 7.′1 apart from OJ 287 and with no clearly seen optical and infrared counterparts, could provide a nearly ideal reference point to track the complicated jet activity of OJ 287. The source J0854+1959 has a stable GHz-peaked radio spectrum and shows a jet structure consisting of two discrete, milliarcsecond-scale-compact and steep-spectrum components and showing no proper motion over about 8 yr. The stable VLBI structure can be interpreted by an episodic, optically thin, and one-sided jet. With respect to its 4.1 mJy peak feature at 43.2 GHz, we have achieved an astrometric precision at the state-of-art level, about 10 μ as. These results indicate that future VLBI astrometry on OJ 287 could allow us to accurately locate its jet apex and activity boundary, align its restless jet structure over decades without significant systematic bias, and probe various astrophysical scenarios.

Toward micro-arcsecond-precision astrometry on the innermost wobbling jet of OJ 287 at 43 GHz: discovery of a mas-scale compact, stable and 7-arcmin-apart reference source

Preprint

Full-text available

Sep 2023

The BL Lacertae object OJ 287 is a very unusual quasar producing a wobbling radio jet and some double-peaked optical outbursts with a possible period of about 12 yr for more than one century. This variability is widely explained by models of binary supermassive black hole (SMBH) or precessing jet/disk from a single SMBH. To enable an independent and nearly bias-free investigation on these possible scenarios, we explored the feasibility of extremely high-precision differential astrometry on its innermost restless jet at mm-wavelengths. Through re-visiting some existing radio surveys and very long baseline interferometry (VLBI) data at frequencies from 1.4 to 15.4 GHz and performing new Very Long Baseline Array (VLBA) observations at 43.2 GHz, we find that the radio source J0854$+$1959, 7.1 arcmin apart from OJ 287 and no clearly-seen optical and infrared counterparts, could provide a nearly ideal reference point to track the complicated jet activity of OJ 287. The source J0854$+$1959 has a stable GHz-peaked radio spectrum and shows a jet structure consisting of two discrete, mas-scale-compact and steep-spectrum components and showing no proper motion over about 8 yr. The stable VLBI structure can be interpreted by an episodic, optically thin and one-sided jet. With respect to its 4.1-mJy peak feature at 43.2 GHz, we have achieved an astrometric precision at the state-of-art level, about 10 $\mu$as. These results indicate that future VLBI astrometry on OJ 287 could allow us to accurately locate its jet apex and activity boundary, align its restless jet structure over decades without significant systematic bias, and probe various astrophysical scenarios.

Results of KVN Key Science Program for evolved stars

Article

Feb 2024

We present the results of KVN Key Science Program (KSP) for evolved stars, which was launched in 2014. The first phase of KSP ended in June 2020 and the second phase started in October 2020. The goal of KSP is to study the physical characteristics of the evolved stars by observing the spatial distribution and temporal variability of the stellar masers at four frequency-bands (K, Q, W and D bands). The 22 GHz H 2 O maser is usually observed from the outer part of circumstellar envelopes compared to the 43, 86, 129 GHz SiO masers, thus the kinematic links between these regions can be studied by the multi-frequency simultaneous observations of KSP along the stellar pulsation cycles. This eventually enable us to study the enormous mass-loss rate of evolved stars, and the accumulated results from KSP are expected to shed light on the study of the late stage of the stellar evolution.

First Observations With a GNSS Antenna to Radio Telescope Interferometer

Article

Full-text available

Aug 2023
RADIO SCI

We describe the design of a radio interferometer composed of a Global Navigation Satellite Systems (GNSS) antenna and a Very Long Baseline Interferometry radio telescope. Our eventual goal is to use this interferometer for geodetic applications including local tie measurements. The GNSS element of the interferometer uses a unique software‐defined receiving system and modified commercial geodetic‐quality GNSS antenna. We ran three observing sessions in 2022 between a 25 m radio telescope in Fort Davis, Texas (FD‐VLBA), a transportable GNSS antenna placed within 100 m, and a GNSS antenna placed at a distance of about 9 km. We have detected a strong interferometric response with a Signal‐to‐Noise Ratio (SNR) of over 1,000 from Global Positioning System and Galileo satellites. We also observed natural radio sources including Galactic supernova remnants and Active Galactic Nuclei located as far as one gigaparsec, thus extending the range of sources that can be referenced to a GNSS antenna by 18 orders of magnitude. These detections represent the first observations made with a GNSS antenna to radio telescope interferometer. We have developed a novel technique based on a Precise Point Positioning solution of the recorded GNSS signal that allows us to extend integration time at 1.5 GHz to at least 20 min without any noticeable SNR degradation when a rubidium frequency standard is used.

VLBI Astrometry of radio stars to link radio and optical celestial reference frames. I. HD 199178 & AR Lacertae

Article

Apr 2023
MON NOT R ASTRON SOC

To accurately link the radio and optical Celestial Reference Frames (CRFs) at optical bright end, i.e. with GaiaG-band magnitude $\lesssim$13, increasing number and improving sky distribution of radio stars with accurate astrometric parameters from both Very Long Baseline Interferometry (VLBI) and Gaia measurements are mandatory. We selected two radio stars HD 199178 and AR Lacertae as the target for a pilot program for the frame link, using the Very Long Baseline Array at 15 GHz at six epochs spanning about 1 yr, to measure their astrometric parameters. The measured parallax of HD 199178 is 8.949 ± 0.059 mas and the proper motion is μαcos δ = 26.393 ± 0.093 and μδ = −0.950 ± 0.083 mas yr−1, while the parallax of AR Lac is 23.459 ± 0.094 mas and the proper motion is μαcos δ = −51.906 ± 0.138 and μδ = 46.732 ± 0.131 mas yr−1. Our VLBI measured astrometric parameters have accuracies about 4–5 times better than the corresponding historic VLBI measurements and comparable accuracies with those from Gaia, validating the feasibility of frame link using radio stars. With the updated astrometric parameters for these two stars, there is a ∼25 per cent reduction of the uncertainties on the Y-axis for both orientation and spin parameters.

Interferometry and Synthesis in Radio Astronomy

Book

Full-text available

Jan 1991

An overview of the basics of radio astronomy is presented as well as a short history of the development of radio interferometry. The underlying relationships of interferometry are discussed with consideration given to the coordinate systems and parameters that are required to describe synthesis mapping and the configurations of antennas for multielement synthesis arrays. Other topics include the response of the receiving system, digital signal processing, VLBI, calibration and Fourier transformation of visibility data, interferometer techniques for astrometry and geodesy, propagation effects, and radio interference.

not submitted

Article

Sep 2007

Markus Stoffel

Radio interferometry : the saga and the science

Article

Jan 2000

Four-Wave Sum-Mixing with Induced Transparency in Atomic Hydrogen

Article

Jan 1994

Robert Ian Thompson

Pulsed coherent radiation at 102.6 nm was generated via four-wave sum-frequency mixing in atomic hydrogen using electromagnetic-field coupling to produce induced transparency. Radiation at 243 and 657 nm was focused into an effusive beam of atomic hydrogen where resonantly enhanced, nonlinear mixing occurred. The 243 nm (pump) radiation was chosen to be in two-photon resonance with the 2s-1s frequency difference, while the 657 nm (coupling) radiation strongly coupled the 3p and 2s levels resulting in suppression of the linear susceptibility of the system and reduction in absorption of the generated radiation due to induced transparency. Experiments were carried out under low atomic density conditions to study the single-atom response of the generation process. A measure of the induced transparency, given by the linear susceptibility, was obtained from ions generated in the process. Measurements of the VUV power and ion signal were carried out under various conditions of the incident laser fields, and the results were shown to be in general agreement with recent steady-state theories of nonlinear mixing with induced transparency. The application of a dc electric field in excess of {~}4 kV/cm was found to suppress VUV generation, and interfere with induced transparency. Nonlinear generation with low coupling power (

Imaging capability of the Mitaka VSOP correlator

Article

Dec 2000
ADV SPACE RES

We have made a comparison between data between the Mitaka and Penticton correlators, using VSOP observations of NRAO530 at 1.6 GHz. After calibrating the data with AIPS we find excellent agreement with the closure phase between the data processed at the two correlators. The images made from both data sets also agree well. We have made other VSOP images of OQ208 at 1.6 GHz, and 3C395, J2011-15 at 5 GHz from data correlated by the Mitaka correlator. These images are all consistent with ground-only VLBI results at higher frequencies, and demonstrate the capability of the Mitaka correlator as a VSOP correlator.

The quadratic density response

Article

Dec 1999

Martin Rommel

In this thesis we provide analytic expressions for the quadratic density response function, establish a series of sum rules and show how the quadratic response provides an improved dynamical local field correction. The closed forms for the quadratic density-density response function c0 (k1, omega1 k 2, omega2) are calculated for a non-interacting Fermi system at zero temperature in one, two, and three dimensions. In the analysis of the result we focus on the two-dimensional case which exhibits a number of interesting features. We find that c0 (k1, omega1 k 2, omega2) vanishes not only in the static long wavelength limit as predicted by the quadratic compressibility sum rule but that there is an adjacent kl- omega l-region where c0 (k1, omega1 k 2, omega2) is purely imaginary. Depending on the angle between k1 and k2 the onset of the real part of c0 can be discontinuous. This and other discontinuities in the real part which all are accompanied by logarithmic singularities in the imaginary part are scrutinized. A systematic series of frequency moments for the quadratic density response and conservation sum rules for the quadratic dynamic structure function is established. Similarly to the linear case the lower order sum rules are exhausted by the RPA structure, but correlational effects govern higher order sum rules. We demonstrate that the correlational correction to the linear response function represents by the dynamical local field correction G( k, omega) is intimately related to the quadratic density response function through the velocity-average approximation. This approach allows a non-perturbative calculation of G(k, omega) which is superior to the widely used STLS.

Very-Long-Baseline Radio Interferometry: The Mark III System for Geodesy, Astrometry, and Aperture Synthesis

Article

Jan 1983
SCIENCE

The Mark III very-long-baseline interferometry (VLBI) system allows recording and later processing of up to 112 megabits per second from each radio telescope of an interferometer array. For astrometric and geodetic measurements, signals from two radio-frequency bands (2.2 to 2.3 and 8.2 to 8.6 gigahertz) are sampled and recorded simultaneously at all antenna sites. From these dual-band recordings the relative group delays of signals arriving at each pair of sites can be corrected for the contributions due to the ionosphere. For many radio sources for which the signals are sufficiently intense, these group delays can be determined with uncertainties under 50 picoseconds. Relative positions of widely separated antennas and celestial coordinates of radio sources have been determined from such measurements with 1 standard deviation uncertainties of about 5 centimeters and 3 milliseconds of arc, respectively. Sample results are given for the lengths of baselines between three antennas in the United States and three in Europe as well as for the arc lengths between the positions of six extragalactic radio sources. There is no significant evidence of change in any of these quantities. For mapping the brightness distribution of such compact radio sources, signals of a given polarization, or of pairs of orthogonal polarizations, can be recorded in up to 28 contiguous bands each nearly 2 megahertz wide. The ability to record large bandwidths and to link together many large radio telescopes allows detection and study of compact sources with flux densities under 1 millijansky.

Precision Timing at the Parkes 64-m Radio Telescope

Article

Jan 2003

M. Bailes

The Mark IV VLBI Data-Acquisition and Correlation System

Article

Jan 1993
Proc Int Astron Union

A. R. Whitney

Modern VLBI observations for both astronomy and geodesy continue to demand the utmost in sensitivity. Of the methods potentially available for increasing the sensitivity of continuum VLBI observations, increasing the recorded bandwidth is generally the most cost effective. Over the past two years a broadly-supported program has been underway at Haystack Observatory to increase the sensitivity of the Mark IIIA VLBI system by more than a factor of 2. The result is an upgrade to the existing Mark IIIA data-acquisition system, dubbed Mark IV, which increases the maximum data rate to 1024 Mbits/sec, more than quadrupling the maximum data-rate of the Mark IIIA.. A new correlator, based on a new custom VLSI correlator chip is also being designed to support the 1 Gbit/sec data rates from the Mark IV data-acquisition-system. An international collaborative effort is being mounted to help defray the high costs of development.

Fundamentals of Radio Interferometry

Article

Jan 1999

A. Richard Thompson

The practical aspects of interferometry are reviewed, starting with a two element interferometer.

DiFX: A Software Correlator for Very Long Baseline Interferometry Using Multiprocessor Computing Environments

Abstract and Figures

Recommended publications

Into the central 10 pc of the most distant known radio quasar. VLBI imaging observations of J1429+54...

High Precision Astrometric Millimeter VLBI Using a New Method for Atmospheric Calibration

Radio interferometry: Theory, techniques, and applications; Proceedings of the 131st IAU Colloquium,...

Very long baseline interferometry and geodetic applications