The Corvil product has successfully addressed a number of the difficult challenges encountered by organizations attempting to do microsecond precision latency measurement at scale. These include:
Before any latency measurements can be made, the problem of microsecond precision time synchronization over the wide area must be addressed. Traditionally, the answer to this problem requires an external time synchronization signal to be provided to every latency measurement appliance. GPS is the most frequently used external time sync signal. The problem with this approach is that is expensive and very difficult to administer over large-scale deployments. In particular, when a trading infrastructure involves technology and services from multiple parties, it is often impossible to get a homogeneous time synchronization solution deployed.
While CorvilNet supports external clock synchronization from sources such as GPS, it also offers a unique automated time synchronization technology that delivers microsecond accuracy. The Corvil time synchronization technology requires zero external resources and is set up automatically between CorvilNet appliances once latency measurement channels are configured. This technology has been operating successfully in production environments since 2007 and is the time synchronization method of choice with Corvil customers. No other solution on the market offers this capability.
Unfortunately many people are becoming familiar with this problem. A typical deployment model for latency monitoring involves the use of SPAN ports on routers and switches. Of late, the use of active aggregation TAPs has also become popular. The problem with SPAN ports and aggregation TAPS is that they can become temporarily overloaded due to spikes in the traffic known as microbursts. These microbursts cause congestion and distort the latency profile of the underlying traffic before it reaches the latency measurement appliance. As a result the latency management appliance will give a false and inaccurate reading. This is often referred to as the GARBAGE IN = GARBAGE OUT problem.
Within the CorvilNet solution there is a SPAN/TAP Port Congestion Detection feature that automatically detects and alarms on any congestion occurring on the measurement port prior to the data being received. CorvilNet will flag any impacted measurements whose integrity cannot be guaranteed. Assuring the integrity and accuracy of any underlying latency management systems is of paramount importance and something, which is unique to the CorvilNet solution at the present time.
In building a Latency Management System one has to fundamentally decide from the beginning the scope and scale of the target solution. Is it primarily aimed at point deployments (e.g. within a co-lo center) or is it expected to cover enterprise scale deployments (e.g. within and between multiple data centers, co-lo centers and liquidity venues). Many of the solutions on the market use a centralized latency measurement architecture. Therefore, it is necessary to backhaul 100% of all captured packets and associated timestamps over the network to a central data store before time correlation and latency measurements can be made. This is extremely bandwidth and storage intensive and means that these solutions cannot be used over the WAN and often encounter significant scaling issues in enterprise deployments.
CorvilNet uses a distributed latency management architecture. There is no backhaul of packet captures from the measurement appliances to a central data store. Time correlation for latency measurement is achieved using a lightweight distributed protocol, which adds minimum overhead to the system. All measurements and analysis are done in real-time at the point of measurement. CorvilNet does however support centralized management via its Latency Management Center, which provides a single point of monitoring and configuration for all deployed CorvilNet appliances while preserving the scalable distributed nature of the underlying architecture. The distributed architecture approach is the main reason why the Corvil solution has been successful in Enterprise scale deployments.
The CorvilNet solution is the only offering on the market that provides integrated packet capture, network analysis and application analysis (i.e. transaction, message and middleware) in one single platform. Some solutions require a separate vendor (i.e. three vendors) for each of packet capture, network analysis and application analysis functions with no integration between them. For example, with Corvil it is possible to track latency of any FIX order per customer and compare the contributions to overall latency of the FIX gateway versus the network and then export all associated packet captures to an external platform for archiving or further analysis. All of this can be done in a single view with full time synchronization maintained between analytics at each layer. Likewise a user can look at the latency impact of a microburst at the network layer and determine what application transactions, messages, and clients were impacted.
Corvil offers decoder libraries for protocol analysis of the leading exchange and liquidity venue formats. In addition, a Decoder Library SDK is being introduced in June 2010 allowing customers to develop custom and/or proprietary decoders for protocols not currently supported in the Corvil native library.
Corvil introduced the first Inter Party Latency Management Solution for Financial Markets in April 2009. This is called CorvilClear. Since then much discussion, debate and industry activity has happened surrounding Inter Party Latency (IPL) and the associated topic of latency transparency in the trading loop. Following the success of the Corvil initiative, it was proposed that a standards process would be undertaken by the FIX Protocol group. This is known as FIPL (FIX Inter Party Latency). Two models for Inter Party Latency measurement are emerging:
The Corvil solution is unique in its ability to support both models and continues to be the only company that has successfully deployed both solutions in the market place:
In addition Corvil has established CorvilClear partnerships with leading service providers to the Financial Markets. An example of this is Savvis who is offering both a managed and unmanaged CorvilClear service using the P2P IPL Model for its clients. Savvis is hosting CorvilClear publishers at the leading equities, futures and options and FX market centers for US and EU markets. Therefore clients connected to the Savvis network can receive in real-time, microsecond precision latency monitoring, troubleshooting and reporting to and from all of these market centers.
Corvil continues to pioneer the development of IPL solutions and is active in the FIPL working group. Corvil intends to be fully compliant with and interoperable with any industry-recognized standard in this area.
Corvil has a proven track record for continuously adding new analytics to CorvilNet that leverages the company’s strong heritage in mathematics and scientific innovation. Such analytics include:
Corvil has the highest performing latency management appliances on the market. The CNE 5400 (4 x 10Gb/s) is capable of processing over 5 million pps with 1μs granular real-time analysis. The CorvilNet Engines use specialized hardware time-stamping with nanosecond precision and performs all analytics in real-time before moving the packets to disk.
The performance of the CorvilNet engine is assisted through the use of the latest and fastest multi-core Intel processors combined with a multi-threaded 64-bit software architecture used for all CorvilNet applications.
Most other solutions move the packets to disk immediately before analysis is possible. This means that 100% of packets must always be captured and stored before analysis is possible. This often introduces a lag in reported information and makes live reporting very difficult. Any packets lost in the process can invalidate the analysis results.