Copyright 2014 Society of Photo-Optical Instrumentation Engineers. This paper was published in Proceedings of SPIE Vol. 9290, 929023, DOI: http://dx.doi.org/10.1117/12.2073377 and is made available as an electronic reprint (preprint) with permission of SPIE. One print or electronic copy may be made for personal use only. Systematic or multiple reproduction, distribution to multiple locations via electronic or other means, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited.

# **Data Processing Board Design for CBM Experiment**

Wojciech M. Zabołotny<sup>a</sup> and Grzegorz Kasprowicz<sup>a</sup>

<sup>*a*</sup>Institute of Electronic Systems, Warsaw University of Technology, ul. Nowowiejska 15/19, 00-665 Warszawa, Poland

#### ABSTRACT

This paper presents a concept of the Data Processing Boards for the Compressed Baryonic Matter (CBM) experiment. Described is the evolution of the concepts leading from the functional requirements of the control and readout systems of the CBM experiment to the design of prototype implementation of the DPB boards. The paper describes requirements on the board level and on the crate level. Finally it discusses the prototype design prepared for testing and verification of proposed solutions, and selection of the final implementation.

**Keywords:** Compressed Baryonic Matter, CBM, Readout, Front End Electronics, First Level Event Selector, Data Processing Boards, DPB, FPGA, MTCA.4, Etherbone

## **1. INTRODUCTION**

The Compressed Baryonic Matter (CBM) experiment<sup>1</sup> at the FAIR facility in Darmstadt (Germany) is a new experiment, aimed on exploration of the QCD phase diagram in the region of high baryon densities during high-energy nucleus-nucleus collisions.<sup>2</sup> The CBM experiments will utilize multiple detectors including the Silicon Tracker (STS),<sup>3</sup> Muon Chamber (MUCH),<sup>4</sup> Transition Radiation Detector (TRD),<sup>5</sup> Micro Vertex Detector (MVD), Time of Flight Detector (TOF)<sup>6</sup> and Ring Imaging Cherenkov detector (RICH).<sup>7</sup> Each detector is connected to the Front End Electronic (FEE) boards, which provide it with necessary communication functionalities:

- Reception of the master clock and the timing information from the Timing and Flow Control (TFC) system.
- Transmission of the Flow Control related status to the TFC system.
- Reception of the synchronous commands from the TFC system.
- Reception of the setup and control commands from the Experiment Control System (ECS) and transmission of responses to those commands
- Transmission of received data to the First Level Event Selector (FLES),<sup>8</sup> which is the first stage of the data acquisition system.

The Front End Electronic boards are located near to detectors in an irradiated area. In the currently proposed setup, in most CBM detectors, the front end boards will be equipped with so called e-port<sup>9</sup> interfaces, and will be connected to the Readout Boards (ROB), which uses the radiation hard GBTX ASIC.<sup>10</sup> The GBT<sup>11</sup> technology was developed in CERN for upgrade of the LHC based experiments.<sup>12</sup> The GBTX chip uses 4.8 Gb/s bidirectional optical link for communication with TFC, ECS and FLES systems located outside the irradiated area. At the the FEE side it allows to establish deterministic latency bidirectional communication with the e-ports connected via SPI-like e-links,<sup>9</sup> transmitting up to 120 bits of information every 25 ns.

Transmission of the acquired data from ROBs via 4.8 Gb/s links to the FLES (over the straight view distance of 350 m, while the length of the links due to limitations to fiber placement will be probably significantly higher) would be unjustified from the cost point of view.

As a solution, the Data Processing Boards (DPB) have been proposed, which are an additional layer, located near to the FEE boards, but outside the irradiated area. Due to this location the DPB layer may be based on standard, more complex electronic systems using the COTS FPGA or even embedded systems and faster optical links. The position of the proposed DPB layer in the CBM control and readout system is shown in Figure 1.

Further author information: (Send correspondence to W.M.Z.) W.M.Z.: E-mail: wzab@ise.pw.edu.pl, Telephone: +48 22 234 7717



Figure 1. Position of the DPB layer in the CBM detector control and readout chain (based on a presentation prepared by Dr Walter Müller).

## 2. REQUIREMENTS FOR THE DPB LAYER

The previously described role of the DPB boards in the CBM experiment results in a series of requirements.

#### 2.1 Reduction of links number

The main reason to use DPB is to allow aggregation of the acquired data, to reduce the number of optical links. To achieve that goal we must use the FLES links with higher speed than the FEE links. The concentration coefficient achievable in that way is simply the ratio between the link speeds, so the speed of the FLES links should be as high as possible. Unfortunately the cost of optical transceivers significantly increases with the link speed. Therefore the 10 Gb/s FLES links seem to be a reasonable choice, allowing to reduce the number of links by factor of 2.

Further reduction of the number of links may be achievable by reduction of amount of transmitted information. The information transmitted by FEE boards contains additional information (e.g. time markers, source identifiers etc.). When this information is aggregated and encapsulated in the format required by FLES, some of this fields become redundant and may be omitted.

The next method to reduce the amount of information which must be sent to the FLES, is to perform local processing of the data provided by the detectors, and to extract and transmit only important features. However applicability of that method is highly detector dependent. Therefore the DPB must be flexible enough to allow implementation of different data processing algorithms.

## 2.2 Transmission of the control commands between the ECS and FEE

The natural method to provide the communication between the ECS and the DPB layer is the computer network (e.g. the Ethernet based one). Therefore the DPB layer must provide translation between the network protocols and simple register access commands, which may be interpreted and performed by the FEE.



Figure 2. Functional blocks to be implemented in the DPB FPGA.

#### 2.3 Transmission of the clock, timing and synchronous commands from TFC to FEE

The FEE layer must receive the master clock together with the synchronization information from DPB layer via communication link. It enforces use of deterministic latency communication protocol. It also requires that the clock used to drive the transmitters for FEE links must be recovered from the receiver clock from the TFC link. The DPB layer must also produce the "busy" status for the TFC system. To accomplish that each DPB board must monitor the occupancy of the connected FEE boards.

### **3. IMPLEMENTATION OF THE DPB LAYER**

As the main function of the DPB is reception of data from multiple fast optical links, and transmission of processed data through another set of fast optical links, the natural platform for DPB implementation is the FPGA chip with multiple gigabit transceivers. Fortunately contemporary FPGA vendors offer reasonable number of gigabit transceivers even in FPGA chips with moderate prices. Good examples may be the Xilinx Series 7,<sup>13</sup> especially the Kintex-7 family.<sup>14,15</sup>

The layout of the internal logic to be implemented in the FPGA is shown in Figure 2

#### 3.1 Implementation of the FEE communication block

The most difficult problem in implementation of the part of FPGA firmware related to the FEE communication was finding the IP cores able to establish deterministic latency communication with the FEE.

The first proposed solution was the CBMnet protocol and IP core<sup>16,17</sup> developed especially for that purpose for CBM.

However the later decision of the CBM collaboration to use the GBT technology (already described in the Introduction) allowed to use the GBT-FPGA<sup>18</sup> IP core dedicated for communication with the GBTX ASIC. The GBT-FPGA core is prepared for implementation in most modern FPGAs equipped with high speed gigabit transceivers (e.g. Xilinx - Virtex 6 & 7



Figure 3. Usage of the GBT-FPGA in the autonomously working DPB board, with shown system for MGT clock recovery and cleaning. The most complex, GBT-FPGA based implementation of TFC system assumed.

and Kintex 7 or Altera - Cyclone V and Stratix V) The GBT-FPGA core may be configured to provide deterministic latency communication in both directions (deterministic latency may be selected for transmitter and/or receiver independently).

As it is mentioned in the Introduction, the GBT-FPGA core itself does not implement any special protocol for communication with FEE chips. Therefore it is necessary to develop such a protocol for particular subdetectors. Example of such protocol developed for the STS subdetector is described in Ref. 19.

#### 3.2 Implementation of the TFC communication block

The timing and synchronization information may be transmitted from the TFC do DPB layer, using different technologies. One of them is the White Rabbit<sup>20</sup> technology. In such a case, the 125 MHz master clock will be distributed, together with the "Pulse Per Second" (PPS) signal. This 125 MHz clock must be later on converted to the 120 MHz clock, needed as the reference clock for GBT-FPGA links. This conversion may be achieved by conversion of the 125 MHz clock first to 5 MHz, and then to 120 MHz. The advantage of this solution is possibility to use low cost Ethernet equipment and cables. The synchronous commands in this solution should be sent in advance using the White Rabbit "critical data" technology, and scheduled for transmission to the FEE in the particular clock pulse.

Another solution, may be usage of GBT-FPGA cores to distribute clock and timing and to send synchronous commands. In this solution the master clock distributed by the TFC is available from the GBT-FPGA receiver, as the RX\_FRAMECLK with frequency of 40 MHz. Additionally the GBT-FPGA receiver delivers the RX\_WORDCLK clocks with frequency of 120 MHz, which after the jitter cleaning may be used as the reference clock (MGT\_REFCLK) for GBT-FPGA links communicating with the FEE boards (see Figure 3).

The jitter cleaner system may be implemented in two different ways:

- With the standard PLL clock distribution IC with external VCXO. The clock recovered from GTX receiver (RX\_FRAMECLK) is fed via a general purpose IO to the reference input of the PLL circuit. The PLL output provides the jitter-cleaned clock which may be used as GTX reference. The disadvantage of this method is introduction of additional jitter during routing the recovered clock to the external pin of FPGA.
- With the hybrid PLL circuit, which consists of external VCXO tuned by serial DAC, controlled by digital filter and DDMTD (Digital Dual Mixer Time Difference) phase detector.<sup>21</sup> The phase detector compares recovered GTX receiver clock and VCXO clock. The error signal tunes the VCXO to keep the phase difference constant. Such approach is used in White Rabbit time synchronization system, deployed successfully in several applications using a few FPGA platforms.<sup>20</sup>

In this solution, transmission of synchronous commands with deterministic latency is provided by the GBT-FPGA core. The significant disadvantage of this approach is use of costly optical components. Additionally this solution provides very high communication bandwidth which will be not fully utilized.

Yet another approach may be to use the hierarchically organized TFC system in which the DPB boards will be connected via relatively short dedicated copper links to the TFC master. In such solution both master clock, the PPS and synchronous commands may be sent using the bi-phase or Manchester encoding. However the jitter cleaner will also be needed, as encoding introduces unacceptable jitter into the received clock.

## 3.3 Implementation of the FLES link

Transmission of the aggregated and preprocessed data to the FLES will be performed via GTX transceivers. As it is stated in Section 2.1, 10 Gb/s links should be used for that purpose. The FLES group has developed the FLES Interface Board (FLIB) and is developing the associated "FLES Detector Input Interface" IP core.

Another interesting option may be to use the 10GBASE-R Ethernet,<sup>22</sup> which allows to use the standard Network Interface Card (NIC) in the FLES. It could be tempting to use the standard TCP/IP protocol to transmit acquired data to the FLES, however implementation of the standard TCP/IP stack working efficiently with 10 Gb/s link in the FPGA consumes a lot of resources. There exists a lightweight implementation of reduced TCP/IP protocol<sup>23</sup> developed for the CMS experiment Data Acquisition System, however sources of this solution are not widely available. Another possibility is to develop a dedicated network protocol, oriented on reliable transfer of data from FPGA to to FLES. The early version of such protocol for reliable transmission of measurement data from the FPGA with small resources consumption has been developed and is publicly available.<sup>24</sup>

The disadvantage of using the 10GBASE-R Ethernet technology is the fact, that it requires bidirectional links. In the FLES link the bandwidth needed for transmission of data from DPB to FLES will be much higher, than the bandwidth needed to transmit acknowledgements from FLES to DPB. If a protocol working with unidirectional links is used (e.g. the Xilinx Aurora 64B/66B<sup>25</sup>), then a single link from FLES to DPB may transmit acknowledgements for a few data links from DPB to FLES. Assuming, that a single DPB board supports 4 FLES links, implementation of the Ethernet base solution would require 8 fibers (4 in each direction), while implementation based on unidirectional links would require only 5 fibers (4 from DPB to FLES, and 1 from FLES to DPB).

#### 3.4 Implementation of the data processing module

As it has been described in section 2.1, the data processing will heavily depend on the detector to which the particular DPB will be connected. For some detectors local processing of data and extraction of significant features can be done, leading to significant reduction of data, while for other detectors, the only processing may be associated with the time sorting of incoming data and encapsulation in the FLES requested "microslice"<sup>26</sup> format. Therefore on the current stage, the DPB firmware will only implement the framework for the detector specific data processing. The proposed framework will contain services such as:

- Reception and unpacking of data from the input GBT links
- Time sorting of data (the versatile sorter has been developed<sup>27</sup> has been developed, but in many cases a simplified methods, based on the knowledge about possible data records ordering, may be applied)

- Encapsulation of the processed data in the microslice containers
- Sending of the data to the output link

An example of the subdetector requiring significant data processing in the DPB layer is the TRD detector,<sup>28</sup> using the SPADIC ASIC.<sup>29</sup>

#### 3.5 Implementation of the slow control module

The slow control will be associated with the access to the internal registers of the DPB FPGA. The standard open and extensible solution for communication with subsystems implemented in FPGAs is the Wishbone bus.<sup>30,31</sup> Because the preferred option to implement the slow control is a standard computer network, we can use an extension allowing to control the Wishbone bus remotely, via Ethernet network – the Etherbone core.<sup>32–34</sup>

## 4. SELECTION OF THE CRATE TECHNOLOGY FOR DPB BOARD

Even though possibility to have a single, autonomously working DPB board is highly desirable during the development, in the final system the DPB boards should be placed in a crate allowing to provide the mechanical support, power supply and connectivity. Additionally organizing the DPB boards in crates allows to avoid implementing of some necessary services in each board individually, as it is described later.

For many years the standard crate solution for the HEP measurement systems was VME,<sup>35</sup> however this technology, is generally outdated and not recommended for new designs. The most powerful crate technology for the measurement and data processing systems is probably the ATCA (Advanced Telecommunications Computing Architecture).<sup>36</sup> Unfortunately this technology is too expensive for our purpose. The cheaper solution widely used in different HEP experiments is the MictroTCA (MTCA),<sup>37,38</sup> especially in the version MTCA.4 (MicroTCA for Physics).<sup>39,40</sup> Therefore finally the MTCA.4 crate was selected for implementation of the DPB infrastructure.

## 4.1 Communication between the TFC system and the DPB crate

The DPB boards will be built as the Advanced Mezzanine Cards (AMC).<sup>41</sup> The DPB crate may contain up to 12 AMCs. To simplify the design, only one of those AMCs will receive the clock, timing and synchronous commands from the TFC system, and send the "busy" status to the TFC (it will be the "TFC slave" board). Preferably this board should also work as a standard DPB board, however it depends on a final choice of the TFC technology. If a solution consuming significant amount of FPGA resources will be chosen, the "TFC slave" will offer only a limited DPB functionality.

The master clock and PPS signal will be distributed by the MTCA.4 backplane using the MCH crossbar. The 120 MHz "Master clock" may be distributed as TCLKA, and the PPS may be distributed as TCLKB respectively. Another TFC related signals may be distributed via 8 M-LVDS lines available as ports 17-20 in the MTCA.4 backplane.

To implement the Flow Control, it is necessary, that the DPB board may signal the "busy" state. In fact it is important to state in the real time, that at least on of the DPB boards is "busy" (more detailed check of their statuses may be done via the slow control interface). This allows to collect the "busy" information using the single M-LVDS line, connected to the M-LVDS type 2 receiver located in the board implementing the "TFC slave" block.

The remaining 7 M-LVDS lines may be used to transmit synchronous commands.

## 4.2 Implementation of the slow control in the DPB crate

The MicroTCA Carrier Hub (MCH) which is the part of the MTCA.4 crate distributes the 1 Gb/s Ethernet to all AMCs board. This allows using it for slow control via the Etherbone core, as described in section 3.5. The disadvantage of this approach is consumption of one GTX transceiver in each DPB card. There are several options that save the GTX resources:

• Usage of MLVDS links. Each AMC board has unique geographical address that can be used to implement simple communication protocol. The board with "TFC slave" block then could be used as a communication bridge between Ethernet and MLVDS. Simple serial protocol could be implemented using even single MLVDS lane. Since the MLVDS can work easily at 80 MHz, achievable data rate seems to be sufficient for slow control.



Figure 4. The prototype DPB board in the MTCA.4 crate with example of signals' routing.

- One can add external SER-DES chip (i.e. TLK3131) connected to PORT0/1 from one side and to some IOs of FPGA on the other side. Then, Etherbone IP core can talk directly with the SERDES using RGMII port.
- Another option is to use RTM connectors to implement point-to-point solution based on LVDS IOs of FPGA. This would require dedicated cabling and custom RTM boards but gives great flexibility. In the case where RTMs are used as holders for optical transceivers, such solution would not add significant cost to the system. In this concept, The board with "TFC slave" block would receive commands over 1 Gb/s Ethernet and would have to implement a bridge between the Etherbone and e.g. the Wishbone Serializer<sup>42</sup> cores. This allows control of the DPB cards via high speed serial connection using standard SERDES<sup>43</sup> blocks available in the Xilinx 7 Series FPGAs.

## 4.3 Implementation of the FEE and FLES connectivity in the DPB crate

There are several connectivity options for FEE and FLES. FLES links are based on single-mode optical transceivers that require 10Gbit/s bandwidth while FEE require multi-mode links running at 4.8Gbit/s. All these requirement can be covered by SFP or SFP+ optical transceivers respectively, but they are bulky and expensive. Moreover, providing that in the final solution we need more than 16 links, they will not fit to standard AMC panel, even with RTM extension card. Alternatively, QSFP transceivers can be used, they would give us up to 24 optical links per single AMC module (see Figure 5). Another, optional 24 links can be placed on the RTM board. Usage of the RTM board adds flexibility to the design since such board is relatively simple and low cost compared to complex and expensive FPGA AMC board. One can produce standard AMC boards with some common connectivity, and series of RTM boards with various optical links, for example mix of QSFPs, SFP+ and MiniPod<sup>44</sup> transceivers. Such a solution makes servicing easier and lowers the cost by reducing number of different expensive AMC boards. Another option is to use FPGA Mezzanine Cards (FMC) which would host optical transceivers. But they are much more constrained due to small size of the board. Single FMC can hold up to 4 SFP+ or 2 QSFP modules. Up to 2 FMCs can be used on single AMC.



Figure 5. Drawing of AMC board with 24 optical links using QSFP transceivers



Figure 6. Drawing of the AMC FMC Carrier with Kintex 7 FPGA (AFCK).

## 5. PROTOTYPE OF THE DPB BOARD

The hardware implementation of the DPB board has been designed, and is available in the Open Hardware repository, as the AMC FMC Carrier Kintex (AFCK)<sup>45</sup> project. It is a versatile board (see Figure 6) with several extension options thanks to 2 FMC sockets and RTM connector. All 16 GTX transceivers can be connected to FMC, AMC or RTM connectors by placing optional 0201 capacitors. The board can work stand-alone, without the MTCA crate, with only single +12V power supply required. It implements 8 MLVDS links that can be used as communication channel. If additional gigabit transceiver is required, it can be placed on FMC board, together with optical modules and connected to Port0/1 of the AMC connector using UFL jumpers. The board also has large (2GB), fast DDR3 memory that can be used to fully buffer two or more 10Gbit data streams. Versatile clock distribution circuit, based on 16x16 crossbar, together with numerous VCOs and VCXOs enables implementation of very complex clocking schemes. The AFCK can be also used to implement the "TFC slave" without hardware modifications.

#### 5.1 Flexibility of the proposed design

The proposed implementation of the DPB prototype allows for further studies on optimal implementation. AFCK is a perfect candidate for the DPB prototype because optical connectivity is provided using low cost FMC boards allowing examination of several options. Use of AFCK board does not require investments in MTCA infrastructure (crate, controller, MCH, PSU). Changes of selected method for slow control communication does not require modification of the hardware part, since the board implements rich connectivity and clocking options.

## 6. CONCLUSIONS

The proposed design of the Data Processing Board prototype provides a hardware platform, allowing verification and testing of different solutions for required functionalities. The DPB prototypes may be used both standalone and in the MTCA crate. Placement of optical transceivers on simple FMC boards or on RTM modules allows to change the optical technology (MicroPOD, SFP+, QSFP) without designing of the new FPGA board. Final DPB solution may be based on the same concept as AFCK. Probably the only modification may be the replacement of FPGA with the next generation chip with increased number of gigabit transceivers.

## ACKNOWLEDGMENTS

Authors would like to thank colleagues from the CBM FLES-DAQ group, especially Dr Walter Mueller from GSI, Dr Krzysztof Kasinski and Prof. Robert Szczygiel from AGH for their work, for sharing ideas and discussions which were basis for the concepts described in this paper.

#### REFERENCES

- Friman, B., Höhne, C., Knoll, J., Leupold, S., Randrup, J., Rapp, R., and Senger, P., [*The CBM physics book : compressed baryonic matter in laboratory experiments*], vol. 814 of *Lecture Notes in Physics*, Springer, Berlin [u.a.] (2011). also as eBook (ebook ISBN: 978-3-642-13293-3).
- [2] "CBM The Compressed Baryonic Matter experiment." http://www.fair-center.eu/for-users/ experiments/cbm.html. [Online; accessed 17-June-2014].
- [3] Heuser, J., Müller, W., Pugatch, V., Senger, P., Schmidt, C. J., Sturm, C., and Frankenfeld, U., [*Technical Design Report for the CBM Silicon Tracking System (STS)*], no. GSI Report 2013-4, GSI, Darmstadt (2013). Also available at: http://repository.gsi.de/record/54798.
- [4] Biswas, S., Schmidt, D. J., Abuhoza, A., Frankenfeld, U., Garabatos, C., Hehner, J., Kleipa, V., Morhardt, T., Schmidt, C. J., Schmidt, H. R., and Wiechula, J., "Development of a GEM based detector for the CBM Muon Chamber (MUCH)," *Journal of Instrumentation* 8(12), C12002 (2013).
- [5] Arend, A. and the Cbm Trd Group, "Status and first results of the CBM TRD prototype development," *Journal of Physics: Conference Series* **426**(1), 012022 (2013).
- [6] Deppner, I., Herrmann, N., Gonzalez-Diaz, D., et al., "The CBM time-of-flight wall," *Nuclear Instruments and Methods in Physics Research, Section A: Accelerators, Spectrometers, Detectors and Associated Equipment* 661(SUPPL. 1), S121–S124 (2012).
- [7] Lebedev, S., Höhne, C., Ososkov, G., and the Cbm Collaboration, "Ring recognition and electron identification in the rich detector of the CBM experiment at FAIR," *Journal of Physics: Conference Series* **219**(3), 032015 (2010).
- [8] de Cuveland, J., Lindenstruth, V., and the CBM Collaboration, "A first-level event selector for the CBM experiment at FAIR," *Journal of Physics: Conference Series* **331**(2), 022006 (2011).
- [9] "e-link IP for FE ASICs." http://indico.cern.ch/event/145696/contribution/3/material/slides/1. pdf. [Online; accessed 17-June-2014].
- [10] Moreira, P., Baron, S., Bonacini, S., Cobanoglu, O., Faccio, F., Feger, S., Francisco, R., Gui, P., Li, J., Marchioro, A., Paillard, C., Porret, D., and Wyllie, K., "The GBT-SerDes ASIC prototype," *Journal of Instrumentation* 5(11), C11022 (2010). Also available at: http://iopscience.iop.org/1748-0221/5/11/C11022/.
- [11] Moreira, P., Ballabriga, R., Baron, S., et al., "The GBT Project," *Proceedings of the Topical Workshop on Electronics for Particle Physics*, 342–346 (2009). Also available at: http://cds.cern.ch/record/1235836.
- [12] Alessio, F. and Jacobsson, R., "A new readout control system for the LHCb upgrade at CERN," *Journal of Instrumentation* **7**(11) (2012).

- [13] "7 Series FPGAs Overview." http://www.xilinx.com/support/documentation/data\_sheets/ds180\_ 7Series\_Overview.pdf (February 2014). [Online; accessed 17-June-2014].
- [14] "Kintex-7 FPGA Family." http://www.xilinx.com/products/silicon-devices/fpga/kintex-7/ (2014). [Online; accessed 17-June-2014].
- [15] "7 Series FPGAs GTX/GTH Transceivers." http://www.xilinx.com/support/documentation/user\_guides/ ug476\_7Series\_Transceivers.pdf (February 2014). [Online; accessed 17-June-2014].
- [16] Lemke, F., Slogsnat, D., Burkhardt, N., and Bruening, U., "A unified DAQ interconnection network with precise time synchronization," *IEEE Transactions on Nuclear Science* 57(2 PART 1), 412–418 (2010).
- [17] Lemke, F. and Bruening, U., "A hierarchical synchronized data acquisition network for CBM," *IEEE Transactions* on Nuclear Science **60**(5), 3654–3660 (2013).
- [18] Baron, S., Cachemiche, J., Marin, F., Moreira, P., and Soos, C., "Implementing the GBT data transmission protocol in FPGAs," *Proceedings of the Topical Workshop on Electronics for Particle Physics, TWEPP 2009*, 631–635 (2009).
- [19] Kasinski, K., Zabolotny, W. M., and Szczygiel, R., "Interface and protocol development for STS read-out ASIC in the CBM experiment at FAIR." This Volume.
- [20] "The White Rabbit Project ." http://www.ohwr.org/attachments/2528/IBIC2013\_WR.pdf (2013). [Online; accessed 16-July-2014].
- [21] "Digital femtosecond time difference circuit for CERN's timing system ." http://www.ee.ucl.ac.uk/lcs/ previous/LCS2011/LCS1136.pdf (2011). [Online; accessed 16-July-2014].
- [22] "IEEE standard for ethernet section 4," IEEE Std 802.3-2012 (Revision to IEEE Std 802.3-2008), 1-0 (Dec 2012).
- [23] Bauer, G., Bawej, T., Behrens, U., Branson, J., et al., "10 Gbps TCP/IP streams from the FPGA for the CMS DAQ eventbuilder network," *Journal of Instrumentation* **8**(12), C12039 (2013).
- [24] Zabołotny, W. M., "Optimized ethernet transmission of acquired data from FPGA to embedded system," *Proc. SPIE* **8903**, 89031L–89031L–12 (2013).
- [25] "LogiCORE IP Aurora 64B/66B v9.2 Product Guide." http://www.xilinx.com/support/documentation/ip\_ documentation/aurora\_64b66b/v9\_2/pg074-aurora-64b66b.pdf (June 2014). [Online; accessed 15-July-2014].
- [26] Hutter, D., de Cuveland, J., and V., L., "CBM Readout and Online Processing Overview and Recent Developments." https://www-alt.gsi.de/documents/DOC-2013-Apr-27-1.pdf (Mar 2013). [Online; accessed 17-June-2014].
- [27] Zabołotny, W. M., "Dual port memory based heapsort implementation for FPGA," *Proc. SPIE* 8008, 80080E-80080E-9 (2011).
- [28] Garcia, C., Bergmann, C., Emschermann, D., Krieger, M., and Kebschull, U., "Beamtest results of the CBM-TRD feature extraction using SPADIC v1.0." http://repository.gsi.de/record/51968/files/PHN-NQM-EXP-34. pdf. [Online; accessed 17-June-2014].
- [29] Armbruster, T., Fischer, P., and Peric, I., "SPADIC a self-triggered pulse amplification and digitization ASIC," in [Nuclear Science Symposium Conference Record (NSS/MIC), 2010 IEEE], 1358–1362 (Oct 2010).
- [30] "SoCInterconnection: Wishbone." http://opencores.org/opencores.wishbone. [Online; accessed 17-June-2014].
- [31] "WISHBONE System-on-Chip (SoC) Interconnection Architecture for Portable IP Cores." http://cdn. opencores.org/downloads/wbspec\_b4.pdf. [Online; accessed 17-June-2014].
- [32] "EtherBone Core." http://www.ohwr.org/projects/etherbone-core. [Online; accessed 17-June-2014].
- [33] Kreider, M., Bär, R., Beck, D., Terpstra, W., Davies, J., Grout, V., Lewis, J., Serrano, J., and Wlostowski, T., "Open borders for system-on-a-chip buses: A wire format for connecting large physics controls," *Physical Review Special Topics Accelerators and Beams* **15**(8) (2012).
- [34] Kreider, M., Terpstra, W., Lewis, J., Włostowski, T., and Serrano, J., "EtherBone A Network Layer for the Wishbone SoC Bus," *Proceedings of ICALEPCS* C111010, WEBHMULT03. 4 p (2011). Also available at:http://accelconf.web.cern.ch/accelconf/icalepcs2011/papers/webhmult03.pdf.
- [35] "VME Technology FAQ." http://www.vita.com/home/Learn/vmefaq/vmefaq.html. [Online; accessed 17-June-2014].
- [36] "AdvancedTCA PICMG 3.0 Short Form Specification." http://www.picmg.org/pdf/PICMG\_3\_0\_Shortform. pdf. [Online; accessed 17-June-2014].

- [37] "Introduction to MTCA." http://www.picmg.org/pdf/introduction\_to\_microtca.pdf. [Online; accessed 17-June-2014].
- [38] "MTCA specification." https://www.picmg.org/pdf/MicroTCA\_Short\_Form\_Sept\_2006.pdf. [Online; accessed 17-June-2014].
- [39] "MTCA.4 at a Glance." http://tesla.desy.de/doocs/MTCA.4.html. [Online; accessed 17-June-2014].
- [40] Schlarb, H., Walter, T., Rehlich, K., and Ludwig, F., "Novel crate standard MTCA.4 for industry and research," *IPAC 2013: Proceedings of the 4th International Particle Accelerator Conference*, 3633–3635 (2013). Also available at: http://accelconf.web.cern.ch/accelconf/IPAC2013/papers/thpwa003.pdf.
- [41] "PICMC AMC.0 R2.0 Short Form Specification." http://www.picmg.org/pdf/AMC.0\_R2.0\_Short\_Form.pdf. [Online; accessed 17-June-2014].
- [42] "Wishbone Serializer Core." http://www.ohwr.org/projects/wb-serializer-core/wiki. [Online; accessed 17-June-2014].
- [43] "7 Series FPGAs SelectIO Resources." http://www.xilinx.com/support/documentation/user\_guides/ ug471\_7Series\_SelectIO.pdf. [Online; accessed 17-June-2014].
- [44] "MicroPOD<sup>TM</sup> and MiniPOD<sup>TM</sup> 120G Transmitters/Receivers." http://www.avagotech.com/pages/minipod\_ micropod. [Online; accessed 7-July-2014].
- [45] "AMC FMC Carrier Kintex (AFCK)." http://www.ohwr.org/projects/afck. [Online; accessed 17-June-2014].