https://slatedb.io logo
Join Discord
Powered by
# general
  • p

    Pierre

    09/11/2025, 10:23 AM
    I wonder if it's not due to "." or ".."
  • c

    criccomini

    09/11/2025, 5:23 PM
    Yea that seems like a . or .. thing...
  • c

    criccomini

    09/11/2025, 5:24 PM
    Maybe OpenDAL returns it but object_store doesn't.. ?
  • p

    Pierre

    09/11/2025, 6:21 PM
    That would make sense
  • c

    criccomini

    09/11/2025, 6:24 PM
    I suppose we can add filtering logic to remove it
  • p

    Pierre

    09/11/2025, 6:26 PM
    Then, I was not able to get proper error messages on misconfiguragion, it just returns a generic error message :/
  • p

    Pierre

    09/11/2025, 6:27 PM
    Perhaps due to the object_store integration
  • c

    criccomini

    09/11/2025, 6:32 PM
    Likely.. that's probably a GH issue for OpenDAL
  • c

    criccomini

    09/11/2025, 6:32 PM
    Thank you for being our guinnea pig with all this πŸ™ 😝
  • p

    Pierre

    09/11/2025, 6:36 PM
    Ah no worries, it’s surprisingly non-trivial
  • p

    Pierre

    09/11/2025, 6:37 PM
    Perhaps things are too abstracted there
  • p

    Pierre

    09/11/2025, 6:37 PM
    I never struggled that much to make s3 work properly 😝
  • d

    diptanuc

    09/11/2025, 8:34 PM
    @criccomini Do you have a sense of how vortex and lance are different?
  • c

    criccomini

    09/11/2025, 8:34 PM
    I haven't dug into Vortex in detail yet.
  • c

    criccomini

    09/11/2025, 8:35 PM
    Lance is basically completely flexible.. you can do whatever you want with it. I suspect Vortex is more opinionated, but in a way that makes it much faster for most workloads
  • d

    diptanuc

    09/11/2025, 8:37 PM
    Lance's positioning was - Parquet doesn't work for unstructured data such as embeddings and images
  • d

    diptanuc

    09/11/2025, 8:37 PM
    Vortex seems to be going for the same positioning
  • c

    criccomini

    09/11/2025, 8:37 PM
    Yup
  • c

    criccomini

    09/11/2025, 8:38 PM
    Lance, Vortex, Nimble etc
  • d

    diptanuc

    09/11/2025, 8:38 PM
    We rolled out a file format in tensorlake which is similar - we couldn't use lance because it was very tied to embeddings
  • d

    diptanuc

    09/11/2025, 8:38 PM
    Our use case was storing raw files and serialized python objects
  • d

    diptanuc

    09/11/2025, 8:38 PM
    Considering to move to one of these formats at some point
  • c

    criccomini

    09/11/2025, 8:38 PM
    Yea it would make sense to. Vortex is definitely worth a look for that kind of use case.
  • c

    criccomini

    09/11/2025, 8:39 PM
    It's odd they tied lance to embeddings. That must be at the API layer. The disk format is pretty straight forward.
  • d

    diptanuc

    09/11/2025, 8:39 PM
    We store the metadata at the front of the file with size of metadata. And after that making get calls with range to paralelly many objects is trivial
  • d

    diptanuc

    09/11/2025, 8:39 PM
    I imagine the client libraries of these file formats will handle all these for us
  • d

    diptanuc

    09/11/2025, 8:39 PM
    Yeah that's the rust api layer
  • d

    diptanuc

    09/11/2025, 8:40 PM
    Also a good move from them by giving it to LSF
  • c

    criccomini

    09/12/2025, 1:29 AM
    All, 0.8.2 has been pushed. I goofed and accidentally did it off main rather than the cherry picked v0.8.2 branch. That said, the only significant fchange is that the garbage collector is using the MessageDispatcher now. Other than that, the two branches are the same.
  • c

    criccomini

    09/12/2025, 1:29 AM
    I'm going to leave it as-is. LMK if you see any issues πŸ™‚