You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Sometimes it is useful to save intermediate chain state, because operations are lazy, chains are not executed immediately and intermediate results are not stored.
For example, if we want to create dc_filtered_1 and dc_embeddings from dc, without saving intermediate dc chain will be executed twice, for each children.
It is possible to do it with save() method without name param, also we have exec() method, but it looks like persist() is better and more verbose name for this method.
After persist() method will be implemented, we may want to make name param in save() method mandatory.
The text was updated successfully, but these errors were encountered:
.persist() is the name of the method in the dataframe API standard. I think that's what we should use - assuming it works exactly as described in the standard.
Follow-up for the #327
Sometimes it is useful to save intermediate chain state, because operations are lazy, chains are not executed immediately and intermediate results are not stored.
For example, if we want to create
dc_filtered_1
anddc_embeddings
fromdc
, without saving intermediate dc chain will be executed twice, for each children.It is possible to do it with
save()
method withoutname
param, also we haveexec()
method, but it looks likepersist()
is better and more verbose name for this method.After
persist()
method will be implemented, we may want to makename
param insave()
method mandatory.The text was updated successfully, but these errors were encountered: