[Openinfralabs] Project Caerus Update
mhild at redhat.com
Thu Aug 5 13:03:40 UTC 2021
thanks for the update.
Have you considered to implement a running version for your POC to
the Operate First Community Cloud at https://www.operate-first.cloud/ ?
There's also a spark cluster available and JupyterNotebooks.
If the community could try out your work, that might increase the awarnes
On Wed, Aug 4, 2021 at 5:02 AM Hui Lei <dr.huilei at gmail.com> wrote:
> Dear all,
> I would like to take this opportunity to give you another update on
> Project Caerus. As you may remember, the project develops techniques such
> as near-data processing and semantic caching to optimize the performance of
> disaggregated data lakes. On the front of near data processing, we have
> implemented the pushdown of a wide range of SQL operators from a Spark
> cluster to a storage cluster that deploys either HDFS (CSV format) or S3.
> Our evaluation using TCPH has shown significant improvements in application
> latency, network I/O and compute-side CPU time. You can check out our design
> and latest evaluation results
> in GitHub.
> On the front of semantic cache, which explores opportune caching of a
> variety of data and metadata, we have the core functionality working, with
> 4x-5x improvement in execution time and CPU time. Again the design
> and the initial evaluation results
> are available in GitHub.
> As always, your comments and contributions are welcome.
> - Hui
> Openinfralabs mailing list
> Openinfralabs at lists.opendev.org
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Openinfralabs