Researchers should be able to focus on research, not on managing data.
Utilizing cloud services to manage and process time series data is rapidly becoming the go-to data solution. This is not surprising given the cost savings and processing power available in cloud-based platforms.
Code Willing is an independent global provider of quantitative research and trading software specializing in cloud-based technology. We provide data management services to handle ingesting, cross referencing, cleaning and storing data. We pride ourselves on building efficient, clean and complete data solutions that allow our clients to focus on research.
Code Willing makes data easy and usable.
By leveraging the Code Willing platform, processing raw vendor data into a clean, readily accessible and usable format is made easy allowing research staff to focus on the core of their business. These robust services include: cross referencing, data quality and scalable storage.
Aligning time-series data from different sources is a significant challenge. Cross-referencing independent vendor data sets and keeping them all in time synchronous order can be even more of a challenge. We accomplish this by assigning a unique identifier to all listings. This identifier, known as the “Code Willing Stable ID” or SID, is used to track specific assets through time and across multiple data sets. Additionally, we manage daily processing jobs in a very controlled and batch-oriented manner using Code Willing’s proprietary job scheduler HAL. HAL handles job dependencies, market schedules, time zone conversions and provides an extremely formattable alerting system to notify operation teams of all on-going processes.
Data Quality is an integral part of the Code Willing suite. Anomalies in data behavior can be difficult to detect and can be very frustrating throughout the research process. We approach data quality using a two-phased approach. Our rules-based algorithms check against vendor supplied documentation to ensure correct formatting and overall content while our machine-learning/pattern-recognition processes can find deeper, more embedded anomalies. We are making great strides towards even more robust machine learning techniques. In general, our data quality processing detects outliers regardless of formatting and changes as it automatically re-trains over time. By applying both a rules-based and an algorithmic approach to spot anomalies, the data quality program ensures that the data is clean and lowers the error rate as part of our daily data pipeline.
Content Addressable Storage (CAS) is a flexible platform for optimized content storage with customizable, role-based permissions. CAS stores the data securely in your data center or in the cloud so that it is readily accessible and it features a single, consolidated interface to all files, regardless of their physical location. The CAS platform works for anyone who requires local or cloud-based storage of large numbers of files and the ability to configure fine-grained permissions for access to those files.
High-performance data access and data processing are core aspects of the Code Willing platform. Through these services, research staff can focus on research and investments without having to worry about routine data management.
Learn more about our Data Management Services.