TensorFlow Similarity

JulianSMoore · 14 September 2021 16:00

As a general question, when TF introduces something like this (TensorFlow Similarity), how long might it be before it could be used within PL?

There are so many TF niches opening up, and even TF haven’t yet produced tensorflow_decision_forests for Windows (Linux only still?) because of other dependencies, so it’s clearly not necessarily trivial, but… just asking how responsive PL perhaps could be once it’s firmed up…

And as a question to all - did anyone see this coming? Previews in research etc.? I wonder what else is in the works - and where/how can one find out?

robertl · 15 September 2021 11:24

Cool stuff!
From a feature standpoint to enable this, it looks like there are two paths; either using the SimilarityModel() or adding all parts of it.

Adding the SimilarityModel() requires the model wrapper to change, which is currently behind the scenes. This would essentially mean a new type of workspace, where the training code “behind” the workspace differs. If you want to compare to our old workflow, this would mean a new training component.

Adding parts of it would require:

Adding the similarity loss function
Adding a way to do clustering in PL (maybe a new workspace dedicated for ensemble models, clustering, etc., kind of a deployment pipeline builder)

Our plans for better tackling these kind of things is to gradually start opening up the customization more and more.
Starting with custom losses, then custom trainer (enabling you to use SimilarityModel() for example) as well as custom data ingestion.
What we want to make sure this time though, is that they are modular, easy to edit and customize and finally and most importantly, save-able and share-able. The hope being that one person finds something cool, can implement it and share with the rest of the community without friction.