[prev in list] [next in list] [prev in thread] [next in thread] 

List:       kde-kimageshop
Subject:    Re: Licensing for models and datasets
From:       Tymon_Dąbrowski <tamtamy.tymona () gmail ! com>
Date:       2024-03-25 16:43:09
Message-ID: CAL5LU_-Ng4zKaWCyuUZ3FQM1=xVHtLRb4JkG8FnPbNtSFHJDyA () mail ! gmail ! com
[Download RAW message or body]

> If it's just links and metadata, then one of the various CCs is fine.
> Some popular datasets include entire images, in which case... I don't
know. I'd avoid that...
Well, we'll have the actual data, not just links. We might not release it
if we don't want to, though.

> 2.) If we don't own the data used to "train" the binary blob model, do we
even own the model?
I'd just ask the artist to license it all on CC-0, then we can use it
however we want. (CC-BY could be already too much since you could argue
that the model is the derivative of data, and the final picture a
derivative of the model, therefore derivative of the data).

Remember that we don't have those images or dataset yet and we can just
choose those that fit our needs, including the licensing.



pon., 25 mar 2024 o 17:33 Emmet O'Neill <emmetoneill.pdx@gmail.com>
napisał(a):

> 1.) Does the dataset contain full, original training images or just plain
> text links to images?
>
> If it's just links and metadata, then one of the various CCs is fine.
> Some popular datasets include entire images, in which case... I don't
> know. I'd avoid that...
>
> 2.) If we don't own the data used to "train" the binary blob model, do we
> even own the model?
>
> Obviously only the owner of some work can license it to others.
> We're kind of in legal no-man's land with all this stuff, so I don't know
> and I don't expect you to know either, but it feels like due diligence to
> consider it.
> Does Intel have any suggestions about this? What do they do?
>
> Even doing my best to put aside my personal (well-documented) feelings on
> copyright and generative AI aside, I don't really understand the
> legal/licensing mechanics of all this stuff to help you make a good
> judgement here.
> If nothing else I'm curious to see where this stuff leads Krita.
>
> On Mon, Mar 25, 2024 at 7:17 AM Halla Rempt <halla@valdyas.org> wrote:
>
>> We're looking into adding an experimental AI-based feature to Krita:
>> automated inking. That gives us three components, and we're not sure about
>> the license we should use for two of them: the model and the datase. Would
>> CC be best here?
>>
>> Halla
>>
>>
>>

[Attachment #3 (text/html)]

<div dir="ltr"><div dir="ltr">&gt; If it&#39;s just links and metadata, then one of \
the various CCs is fine.<div>&gt; Some popular datasets include entire images, in \
which case... I don&#39;t know. I&#39;d avoid that...<br></div><div>Well, we&#39;ll \
have the actual data, not just links. We might not release it if we don&#39;t want \
to, though.<br><br>&gt; 2.) If we don&#39;t own the data used to &quot;train&quot; \
the binary blob model, do we even own the model? <br></div><div>I&#39;d  just ask the \
artist to license it all on CC-0, then we can use it  however we want. (CC-BY could \
be already too much since you could argue  that the model is the derivative of data, \
and the final picture a  derivative of the model, therefore derivative of the \
data).<br></div><div><br></div><div>Remember that we don&#39;t have those images or \
dataset yet and we can just choose those that fit our needs, including the \
licensing.<br></div><div><br><br></div></div><br><div class="gmail_quote"><div \
dir="ltr" class="gmail_attr">pon., 25 mar 2024 o 17:33  Emmet O&#39;Neill &lt;<a \
href="mailto:emmetoneill.pdx@gmail.com">emmetoneill.pdx@gmail.com</a>&gt; \
napisał(a):<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px \
0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div \
dir="ltr"><div>1.) Does the dataset contain full, original training images or just \
plain text links to images? <br></div><div><br></div><div>If it&#39;s just links and \
metadata, then one of the various CCs is fine.</div><div>Some popular datasets \
include entire images, in which case... I don&#39;t know. I&#39;d avoid \
that...<br></div><div><br></div><div>2.) If we don&#39;t own the data used to \
&quot;train&quot; the binary blob model, do we even own the model?  \
</div><div><br></div><div>Obviously only the owner of some work can license it to \
others.</div><div>We&#39;re kind of in legal no-man&#39;s land with all this stuff, \
so I don&#39;t know and I don&#39;t expect you to know either, but it feels like due \
diligence to consider it.</div><div>Does Intel have any suggestions about this? What \
do they do?<br></div><div><br></div><div>Even doing my best to put aside my personal \
(well-documented) feelings on copyright and generative AI aside, I don&#39;t really \
understand the legal/licensing mechanics of all this stuff to help you make a good \
judgement here.  </div><div>If nothing else I&#39;m curious to see where this stuff \
leads Krita.<br></div></div><br><div class="gmail_quote"><div dir="ltr" \
class="gmail_attr">On Mon, Mar 25, 2024 at 7:17 AM Halla Rempt &lt;<a \
href="mailto:halla@valdyas.org" target="_blank">halla@valdyas.org</a>&gt; \
wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px \
0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">We&#39;re looking into \
adding an experimental AI-based feature to Krita: automated inking. That gives us \
three components, and we&#39;re not sure about the license we should use for two of \
them: the model and the datase. Would CC be best here?<br> <br>
Halla<br>
<br>
<br>
</blockquote></div>
</blockquote></div></div>



[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic