Data Activation Thoughts

(galsapir.github.io)

21 points | by galsapir 2 days ago ago

5 comments

  • armcat a day ago ago

    I've been working in legaltech space and can definitely echo the sentiments there. There are some major legaltech/legal AI companies but after speaking to dozens of law firms, none of them are finding these tools very valuable. But they have signed contracts with many seats, they are busy people, and tech is not intrinsic to them, so they are not in the business of just changing tools and building things in-house (a handful of them are). And the problem is despite massive amount of internal data, all the solutions fail on the relevance and precision scale. When I sit down with actual legal associates, I can see how immensely complex these workflows are, and to fully utilize this data moat you need: (1) multi-step agentic retrieval, (2) a set of rules/heuristics to ground and steer everything per transaction/case "type", (3) adaptation/fine-tuning towards the "house language/style", (4) integration towards many different data sources and tools; and you need to wrap all this with real-world evals (where LLM-as-a-judge technique often fail).

    • dennisy a day ago ago

      Could you please expand on “none of them find the tools very useful”?

      I would love to know how big your sample is, in what way the tools fail, what features are missing etc.

  • sgt101 a day ago ago

    How to know if one should fine tune/pretrain or RL / reasoning train given some data set?

    • galsapir 20 hours ago ago

      i honestly dont think there's a simple y/n answer there - i think considerations include mostly like 'how costly it is to do so', 'how often do you think you'll need it', and so on. traces are not as "ephemeral" as FT models - since you can use those to guide agent behaviour when a newer model is released (but still, not as evergreen as other assets - traces generated using say GPT4 would seem pale and outdated compared to ones created on the same dataset using Opus4.5 i reckon)

  • 2 days ago ago
    [deleted]