Re: [Uri-review] Request for Review and Registration of New URI Scheme "ai"

IS4 <is4.site@gmail.com> Sun, 31 December 2023 20:16 UTC

Return-Path: <is4.site@gmail.com>
X-Original-To: uri-review@ietfa.amsl.com
Delivered-To: uri-review@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 99F40C14F5F8 for <uri-review@ietfa.amsl.com>; Sun, 31 Dec 2023 12:16:31 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.106
X-Spam-Level:
X-Spam-Status: No, score=-2.106 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id bZe6obapJGML for <uri-review@ietfa.amsl.com>; Sun, 31 Dec 2023 12:16:27 -0800 (PST)
Received: from mail-ed1-x531.google.com (mail-ed1-x531.google.com [IPv6:2a00:1450:4864:20::531]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 7746AC14F602 for <uri-review@ietf.org>; Sun, 31 Dec 2023 12:16:27 -0800 (PST)
Received: by mail-ed1-x531.google.com with SMTP id 4fb4d7f45d1cf-555bd21f9fdso2170584a12.0 for <uri-review@ietf.org>; Sun, 31 Dec 2023 12:16:27 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1704053785; x=1704658585; darn=ietf.org; h=content-transfer-encoding:in-reply-to:from:references:to :content-language:subject:user-agent:mime-version:date:message-id :from:to:cc:subject:date:message-id:reply-to; bh=qqOFd5Ak0w2AE2bedPSejOBVJecbQQ70Ltq26yrfjDw=; b=PqWk2CmOI8699Xr5UFxvRzKYcqploFdbVmVY2LOcpLh6ljBAKRvhchq5P+vj2TgOwq Q1P/oLqJ5Qt0tFcTj2BGtgXqcKgWHdz9i67rF+q5VN0KNIT77X90NizIzIdjPa3kzVSH M+CjdxoGJLC8EthW60I1Z3Exra7ZJdrkjCzY1EE7d37TnYzgIhhLcyf4es15LAXWpVIQ 6jx4fKubqe7LANBHazDqE7xjzDuFRvtKb6wEVmR9SWzUOGB4g5k2uI06KHeVJeNsKm0q r0XQnfXjVcT5o/nIGUvYtDAHKH2+n0FF0FTaenFZbxsNJH7GNrVhf6OkcesXdU8ea5dv tS8g==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1704053785; x=1704658585; h=content-transfer-encoding:in-reply-to:from:references:to :content-language:subject:user-agent:mime-version:date:message-id :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=qqOFd5Ak0w2AE2bedPSejOBVJecbQQ70Ltq26yrfjDw=; b=Z3wAaoGHEctvRTcWHUpk3Qo90wCLJngo/uNflBzoyd/kN6a+tmPFivCYichqhELJAr 5xXGUs9YY7TfnFkVLRp/oM3kU0b1cpGi4GhYCqqmz+jpiA6Gaq20zcETgwJw0TqvdiHl ZgccIsywNbW3wMIB+PCzrykPwPsPSFnymMjc8V32yaA5LP1CuBKY1K9Y1/X7Th6fk1oy DWEkhsU5W6k51zh66bULKTz4ADjpsKgasHT4HqjKpM8WvFq6jeVYE8YeZYevhgzS6ZW+ KUEgALrytyyVcTTC/S59vp+Ej/+Csn0BREdYDXkro9r6TBImNM0vBlBGBGQMzOVc+HuC xXhg==
X-Gm-Message-State: AOJu0YwpaZo3clKDqIJh3yLvOIhbFQCzSCcpNkvMDDaqI4tAOKrwOR4A oSuaeaJvN1huaV5PxoJDJof8takN8x1QgQ==
X-Google-Smtp-Source: AGHT+IHWK1hcZGItyyIWF+sLCUwc2ujFbXdIRv54Oara2M5Xe4mCQHmgscltab3ArI5tPyHuAeLNSQ==
X-Received: by 2002:a50:c259:0:b0:556:5bb6:ff5b with SMTP id t25-20020a50c259000000b005565bb6ff5bmr11994edf.12.1704053785220; Sun, 31 Dec 2023 12:16:25 -0800 (PST)
Received: from [192.168.50.35] (238-134.mason-net.cz. [46.30.238.134]) by smtp.gmail.com with ESMTPSA id l7-20020a056402344700b005561c3997c6sm1599938edc.9.2023.12.31.12.16.24 for <uri-review@ietf.org> (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Sun, 31 Dec 2023 12:16:24 -0800 (PST)
Message-ID: <8ee73eae-1b2f-4f3f-ba39-3d78cac8d12b@gmail.com>
Date: Sun, 31 Dec 2023 21:16:23 +0100
MIME-Version: 1.0
User-Agent: Mozilla Thunderbird
Content-Language: cs
To: uri-review@ietf.org
References: <2047091996.519294.1696656878248@privateemail.com> <93ee3113-0ab2-7124-678b-cc523571e613@it.aoyama.ac.jp> <1258868613.683466.1696903560871@privateemail.com> <d6e20b29-f03b-4a7d-eddb-feee1dedcaf7@ninebynine.org>
From: IS4 <is4.site@gmail.com>
In-Reply-To: <d6e20b29-f03b-4a7d-eddb-feee1dedcaf7@ninebynine.org>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: 8bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/uri-review/K69IciJJOLfuphogpM_ywmVLwtY>
Subject: Re: [Uri-review] Request for Review and Registration of New URI Scheme "ai"
X-BeenThere: uri-review@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Proposed URI Schemes <uri-review.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/uri-review>, <mailto:uri-review-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/uri-review/>
List-Post: <mailto:uri-review@ietf.org>
List-Help: <mailto:uri-review-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/uri-review>, <mailto:uri-review-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 31 Dec 2023 20:16:31 -0000

Hello,

with all due respect to Collin (and pardon me if I am mistaken), I have 
the feeling his responses were written partially or fully by GPT (based 
on my experience, but https://gptzero.me/ has the same feeling) which 
might explain why they are written in a bit generic style and it is hard 
(at least to me) to understand the content.

That being said, I do find value in AI-related URI schemes, and I can 
imagine some that would fit the purpose of URIs better, such as for 
generative AI. As an example:

text.gen:?prompt=Give+me+a+list+of+10+reasons+why+URI+schemes+are+good&model=gpt-4

img.gen:?prompt=cat+sitting+on+a+table&model=stable-diffusion-v2

This clearly indicates the actual resource identified by the URI, which 
is the text/image generated by a particular model. If a compatible 
software is installed, it could automatically load the prompt and other 
parameters and generate the output, or allow the user to tweak it. It 
could be used in a "source URI" metadata attribute for images or 
documents to encode how they were generated.

The issue is finding a common ground for every generative AI situation 
that there is. The prompt is the shared parameter, but it may not be a 
single string but a collection of strings or even structured messages 
(e.g. with OpenAI chat-based roles "user", "system", "assistant"), there 
may be a "negative prompt" too to adjust the weights in the opposite 
direction, both of which are affected by a particular tokenization or 
preprocessing mechanism, which might be bypassed if the tokens are 
provided directly instead of text as the prompt. The model has to be 
identified somehow too, not only the actual technology (GPT, Stable 
Diffusion) but also the encoded weights (one common way is through a 
hash, e.g. urn:sha256:...), and then there is the myriad of 
technology-specific parameters, or even other external resources 
necessary for generating the proper output (such as images or masks for 
img2img). Maybe this is the right time to come up with a baseline 
scheme, but it would definitely have to be very extensible to be 
considered in future technologies.

I have started a discussion for Stable Diffusion 
<https://github.com/AUTOMATIC1111/stable-diffusion-webui/discussions/14405> 
to see if there is a need for such a scheme in the first place.

IS4

On 24.10.2023 17:12, Graham Klyne wrote:
> Hi Collin,
>
> I'm struggling to understand what is being proposed.  I have some 
> questions, which somewhat overlap with concerns raised by Ted and Martin:
>
> 1. Would the intended goals be better served by a MIME content type 
> registration, since the proposal seems to be about the nature of the 
> data transmitted rather than the means of transmission?  I am really 
> not sure; e.g., see (4) below.  For example, considering "The `ai` 
> scheme is aimed at enabling real-time or near-real-time communication 
> for streaming data among generative Artificial Intelligence (A.I.) 
> applications and protocols.", I'm wondering if the existing ws: 
> protocol coupled with new content-types specific to AI models might 
> serve your needs.  With reference to "novel communication paradigms", 
> I am also wondering if elements of RTSP might be applicable.
>
> 2. Why does this need to be a URI?  In what contexts might it be used 
> where some other URI scheme might also be used?  If there are none, 
> isn't it just a private-use string that happens to look like a URI?
>
> 3. What resource is actually identified by such a URI?  Most URI 
> schemes are associated with some kind of mechanism (executable or 
> administrative or something else) that associates a URI with some 
> resource that it denotes.  Some schemes are associated with particular 
> wire protocols (http, https, ftp, ws, etc.), where the identified 
> resource is something that can be interacted with using said 
> protocol.  Other schemes are associated by some non-executable 
> mechanism, such as administrative delegation (e.g. urn) or as a 
> constructed unique identifier (e.g. ni). In thinking about these 
> questions, a touchstone I sometimes use is: what do I expect to happen 
> if this URI is dereferenced (e.g., when used as an @href link in an 
> HTML document, and the link is clicked)? Commonly, what one expects is 
> that some representation of the identified resource may be returned.
>
> 4. What kinds of interaction with identified resources are possible or 
> expected?  Are there interactions that are specific to AI applications 
> that aren't allowable or implementable using existing schemes?
>
> There don't need to be definitive answers to these questions, but I 
> find it helps to think about such issues to understand how a new URI 
> scheme might sit with the wider World Wide Web architecture.
>
> #g
>
>
> On 10/10/2023 03:06, Collin Paran wrote:
>> Ted and Martin,
>>
>> I sincerely appreciate the detailed feedback from both of you! Your 
>> insights are invaluable in refining the "ai" URI scheme proposal to 
>> ensure it aligns with established guidelines and serves its intended 
>> purpose effectively. I have taken the time to review a few of the 
>> concerns raised and provided responses below:
>>
>> 1. General Data vs. AI Data:
>>      - I understand the point Martin has raised regarding the 
>> universality of protocols like HTTP and WS, which are agnostic to the 
>> type of data they carry. The primary motivation behind proposing a 
>> specific URI scheme for AI data is to facilitate specialized 
>> communication needs of generative AI applications, especially with 
>> the advent of AI photonic compute and laser-based communications 
>> which could offer novel communication paradigms.
>>
>> 2. Applicability Beyond AI:
>>      - I agree with the suggestion that if a new protocol surfaces 
>> from the AI communication needs, it should not be limited to AI but 
>> should be useful for other applications as well. I will reconsider 
>> the naming of the URI scheme to reflect a broader scope.
>>
>> 3. Authority Information:
>>      - The initial proposal lacked authority information, which is 
>> crucial for identifying the party making the data available and 
>> ensuring data from multiple parties can be accommodated. I will work 
>> on including authority information in the URI scheme to address this 
>> concern.
>>
>> 4. Scheme Name:
>>      - The suggestion to consider a more descriptive or unique scheme 
>> name like "wsai:" or "aiws:" is well taken. I will explore 
>> alternative naming options that encapsulate the essence of the 
>> protocol while not being overly restrictive to AI.
>>
>> 5. Grammar and Security Considerations:
>>      - As earlier mentioned in response to Ted's feedback, I will 
>> work on providing ABNF definitions for the various components of the 
>> URI scheme, and outline the security considerations as required by 
>> RFC 7595.
>>
>> 6. Engagement with the Community:
>>      - I am open to further discussions and feedback from the 
>> community to ensure the proposal meets the necessary standards and 
>> caters to the intended use cases effectively. I'm committed to making 
>> any necessary revisions based on the community's feedback.
>>
>> 7. Registry and Token Standardization:
>>      - In line with Ted's feedback, I'm considering the establishment 
>> or adoption of registries for common models, actions, and AI types to 
>> foster common understanding and ensure the URI scheme serves its 
>> purpose effectively.
>>
>> I am looking forward to further discussions on the mailing list and 
>> appreciate the time and effort you both have put into reviewing the 
>> proposal and providing insightful feedback.
>>
>> Thanks again.
>>
>> Respectfully,
>>
>> Collin
>> collin@professai.com
>>
>>> On 10/09/2023 5:28 PM MDT Martin J. Dürst <duerst@it.aoyama.ac.jp> 
>>> wrote:
>>>
>>>   Hello Collin, others,
>>>
>>> A few more comments on top of Ted's, written in a somewhat more direct
>>> style.
>>>
>>> It's unclear to me why AI data is in any way special. Protocols such as
>>> HTTP or WS don't care what data they carry, and data shouldn't care 
>>> what
>>> protocols it's been carried by. As an example, Web Sockets is used (I
>>> guess) for search results, for stock price data, for gaming data, 
>>> and so
>>> on, but there's no need for anything other than a ws: scheme. Same 
>>> for http.
>>>
>>> On the other hand, if AI would surface the need for a new protocol,
>>> there's no need to limit this to AI; there's a high chance that such a
>>> protocol may be useful for other applications, too. In that case, the
>>> scheme name "ai" would be inappropriate.
>>>
>>> Your scheme proposal doesn't contain any kind of authority information
>>> (that would usually be some domain name). So it's unclear how data 
>>> could
>>> be made available by multiple parties, or how the party making the data
>>> available could be identified. This is different both from
>>> "type-of-generative-ai" and from "specific-model".
>>>
>>> [Of course, it might be the case that this is only intended for the use
>>> by a single party providing data, but that's not what URI schemes 
>>> are for.]
>>>
>>> If indeed there is a need for an application-specific URI scheme for 
>>> AI,
>>> that may not be limited to your proposal. It would therefore be
>>> premature to use such a short scheme name for this specific proposal;
>>> maybe "wsai:" or "aiws:" or something along these lines would be more
>>> appropriate (assuming that there's a need for an application-specific
>>> URI scheme at all, which I strongly doubt (see my first point)).
>>>
>>> Regards,   Martin.
>>>
>>> On 2023-10-07 14:34, Collin Paran wrote:
>>>> Dear IANA and URI Review Mailing List Members,
>>>>
>>>> I am writing to request a review and registration of a new URI 
>>>> Scheme, "ai", designed for facilitating communication in generative 
>>>> A.I. applications. The scheme is inspired by WebSockets and allows 
>>>> the passing of JSON data for streaming words, video data, or images 
>>>> between generative A.I. entities.
>>>>
>>>> Please find below the URI Scheme Registration Template as per the 
>>>> guidelines provided in RFC 4395.
>>>>
>>>> URI Scheme Name: ai
>>>>
>>>> Status: Provisional
>>>>
>>>> URI Scheme Syntax:
>>>>       * 
>>>> `ai://<type-of-generative-ai>/<specific-model>/<action>?<parameters>`
>>>> URI Scheme Semantics:
>>>>       * The `ai` scheme is aimed at enabling real-time or 
>>>> near-real-time communication for streaming data among generative 
>>>> Artificial Intelligence (A.I.) applications and protocols.
>>>> Encoding Considerations:
>>>>       * UTF-8 encoding for multilingual support and binary format 
>>>> for multimedia data encapsulated in JSON objects.
>>>> Applications/Protocols that use this URI scheme name:
>>>>       * Generative A.I. applications and protocols requiring 
>>>> real-time or near-real-time communication for streaming data.
>>>> Interoperability Considerations:
>>>>       * Compatibility with existing WebSocket implementations and a 
>>>> standard JSON format for data communication.
>>>> Security Considerations:
>>>>       * Usage of Artificial Intelligence Secure (AIS) which is also 
>>>> interoperable with wss (WebSocket Secure) for encrypted 
>>>> communication and authentication mechanisms for authorized access.
>>>> Contact:
>>>>       * Collin Paran, collin@professai.com mailto:collin@professai.com
>>>> Author/Change Controller:
>>>>       * Collin Paran, collin@professai.com mailto:collin@professai.com
>>>> References:
>>>>       * Pending public release.
>>>> I am looking forward to receiving feedback from the community and I 
>>>> am open to making any necessary revisions to ensure the `ai` scheme 
>>>> aligns with the established guidelines and standards. I am also 
>>>> eager to engage in discussions on the `uri-review@ietf.org` mailing 
>>>> list and other relevant forums as appropriate.
>>>>
>>>> Thank you for your time and consideration.
>>>>
>>>> Warm regards,
>>>>
>>>> Collin Paran
>>>> collin@professai.com
>>>>
>>>>
>>>> _______________________________________________
>>>> Uri-review mailing list
>>>> Uri-review@ietf.org
>>>> https://www.ietf.org/mailman/listinfo/uri-review
>> _______________________________________________
>> Uri-review mailing list
>> Uri-review@ietf.org
>> https://www.ietf.org/mailman/listinfo/uri-review
>