[art] Re: use of Emojis in Internet Standards
Rob Sayre <sayrer@gmail.com> Tue, 10 March 2026 22:37 UTC
Return-Path: <sayrer@gmail.com>
X-Original-To: art@mail2.ietf.org
Delivered-To: art@mail2.ietf.org
Received: from localhost (localhost [127.0.0.1]) by mail2.ietf.org (Postfix) with ESMTP id BFF37C7DCF76 for <art@mail2.ietf.org>; Tue, 10 Mar 2026 15:37:59 -0700 (PDT)
X-Virus-Scanned: amavisd-new at ietf.org
X-Spam-Flag: NO
X-Spam-Score: -2.098
X-Spam-Level:
X-Spam-Status: No, score=-2.098 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: mail2.ietf.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail2.ietf.org ([166.84.6.31]) by localhost (mail2.ietf.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id nIwYN28rUI4D for <art@mail2.ietf.org>; Tue, 10 Mar 2026 15:37:59 -0700 (PDT)
Received: from mail-pj1-x1033.google.com (mail-pj1-x1033.google.com [IPv6:2607:f8b0:4864:20::1033]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature ECDSA (P-256) server-digest SHA256) (No client certificate requested) by mail2.ietf.org (Postfix) with ESMTPS id 57E14C7DCF65 for <art@ietf.org>; Tue, 10 Mar 2026 15:37:59 -0700 (PDT)
Received: by mail-pj1-x1033.google.com with SMTP id 98e67ed59e1d1-358e3cc5e7eso6673579a91.0 for <art@ietf.org>; Tue, 10 Mar 2026 15:37:59 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; t=1773182278; cv=none; d=google.com; s=arc-20240605; b=frHIBqjcNQ6EidVTr6uuLKG7h0VGBhgc0nYSJQococSWyaFpxGmvXIr3gFSiCVY1rE Otsc73RQQMpxYjlAp2FGhxT0Ne12tF1xILIt2/UjQ061Fa8krjW0h9Ha/eIvbE4wbzWR T4ONcuZVriK8L9UZfwtM6p7nmtkZX3RxCvPBqNHkuUXzx8ZxRDr9gy3vyAkf9R+R+mgO Yck1Zb2Bj+dTieuhtd0owh5wc/oUUs1fKqwbuOyqRXX9158++uusju8pV/j71byFCqg0 Dgqo0Gy8D3VLv/bH1KP1GG2bPG/WGVNh5jQS/Bo+HQFyflvN9fyfGw2RObiehOLfnMfo YWVA==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20240605; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=YTvJKvsFNDj1p1e1Woncvsfh3A3g7qKJxlf+4K6ZhmQ=; fh=n5sze2q3Hiv5wl3OwBbyb7z8HfGpnXVdpjhzZ8gCiK8=; b=ce4AS8loPpjmpyAHr45WF3yFSZNgAu0O2Kf6eL2lcsLZFX7F7kqbWeDceY1cgV4MjE MzmTI+l1RshWoXG8w/jtI4HGzMZGbn3f8Y+yxrTwgXcEXx2wIqoBd+JWtb8k204VuHxa wo4giuiLPIZU8EycGs9r3nVcGKS9xf7mjIrszJcbRHvH0kj3wc+7Rx5ppCdKlpG+dO9M 7MdNYvueIlv01atFJ7ilAnxCJFzu4uJwVPUX3OxQAaDcOXnWw5yiRIDv8TdSJw/tb+6v MZt8VlBN2tYdMDupl6aLHQPR04n3CckdHDCdApU6e6TEP8vxB5ILsFy04IQ72WvwM4Rb o8YA==; darn=ietf.org
ARC-Authentication-Results: i=1; mx.google.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1773182278; x=1773787078; darn=ietf.org; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :from:to:cc:subject:date:message-id:reply-to; bh=YTvJKvsFNDj1p1e1Woncvsfh3A3g7qKJxlf+4K6ZhmQ=; b=YEPZnxgcpGVKmJkgiTGgXItvFuP8EcwpgzgIUthfoS0civJR6yrRxyVt5wuNs6pQe9 tRSlc9BjtVN9OeBHqDzLLxd6hvTRfHxYKysWWcNMjAE8iZj5LP4/Q3uLBLgNtgc3S71G 26iXfT4pNL/at++Pg9m98Ptvx+pdW4d8h9izViita1tqn7z4tEVleXrvslHKuGRx5L85 iOevbVmnPiKMEGeZxZw9Zw17UNUbaZTPjSAJGHtDeWR+COqDsoz/bQ4C0ieu6QzN5oxH feznnn3bWhofbMtp5B1wZPPpHSb5u79iwVAnmbYnI6V8Fy4hlpQ0/b0ksv6A06SZ5IJX fjBQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1773182278; x=1773787078; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :x-gm-gg:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=YTvJKvsFNDj1p1e1Woncvsfh3A3g7qKJxlf+4K6ZhmQ=; b=J23Z6hEqANF/6oSNhqiCr0ReYoWH0e3V3UbDIIntwFJW6OxDHL7qphkZK4WveqVolv vipBwMNV2/AtBN0ZWbUkRjy7eKWEoVG2ayZkiSonDN8SBJ4itGgrWdrkb5zDkQHvVngQ x8H+1S2EaULYLMlLsXmUtnVvLRFSfHXKBSY4Y7VNyL7eRxgMhW+mbF6FOyFEhX3SItOV dVv4R3vKCaJmfYdwInoTRJYfVPAYXdD14GaAVprJPpqLasD0DFgUiW9me36VFnJZe6+t FpABOofg/GP4Rp5Vm8XVLsR0YjXdfj5cqjxcrpjdRkvUA/FjsEg7mkA8ef3fY7uirGVP ZXiA==
X-Gm-Message-State: AOJu0YwgzyDdbU0rySlSFi1MlzbEY2aJT69f47YDn+MiDD87TqSkpvUU YW/iorDdtfnmbkmAGwVZt5sNLoVfP4Ov39NJZJPcRr0AYEwu89pO583eokzj2aOhkauDM3bfhxl wqm90jNP6IM3K2FcWsxHTh8LRokvRoyx4sjnf
X-Gm-Gg: ATEYQzzn1mPjbgSjDkKtjoLVtq+tNKGksqvciQ9aTZVttr7+1cxuZq7s5IdqvrBgifp 6DrvJ7nFAGWbJr9VV6q4Zv/TB3EZcMOISCVNmTVFVYjAopnHLyKDTOfPgyGHh2fvdKmsb/bLaIV FK2RRtmZhw+MmWvkTHhnyaeRajOSpBTM2oIsI/P40twZ60Wsgh+wuileBDxNCBFIEG/JYdt4H0x fkEeZ7aTgQJo9C1gtTQkCuNIplW/DnO/iOgnOmlSe0ZUp9q/JA7imJ4QLze3m4rGIi+Mgn7+PW4 TJ3+H6rKv37Bf5bguCKIMiNfAaNZdYmNrAGw3h6z
X-Received: by 2002:a17:90b:3d0e:b0:359:ff8a:ee47 with SMTP id 98e67ed59e1d1-35a011a7baemr453634a91.6.1773182278100; Tue, 10 Mar 2026 15:37:58 -0700 (PDT)
MIME-Version: 1.0
References: <CAFwChB=OJUVOtGwGHP=8yzV8_NN9Q24zg+YB=UoJt-MH7RDFyg@mail.gmail.com> <CAFwChB=amTWoVJc+bQHm3QsQagogt=p7u_s8W43RPj-5hcUbHg@mail.gmail.com> <CAChr6SxgZEddsE=xADcb+5Okr+VwssiyOpHHTmD5hGoNNTrpBw@mail.gmail.com> <tdtuptotjoc6aoz3b4iay4dzegrt73ojoehcnl7lqe6pfs35fn@quo4bfnewxjo> <CAChr6SzZyRry4J=HiybPYiufpXbqr2QcAV8LGCYaKA4oiKd9Zg@mail.gmail.com> <p3jpdb2yar3uaifyy55iuivcjostgnhncn6shgoxjssbt474dn@rmkf2q5djd3c>
In-Reply-To: <p3jpdb2yar3uaifyy55iuivcjostgnhncn6shgoxjssbt474dn@rmkf2q5djd3c>
From: Rob Sayre <sayrer@gmail.com>
Date: Tue, 10 Mar 2026 15:37:47 -0700
X-Gm-Features: AaiRm52OmTOR3mQYokhpIN9_GqfqtycbFqkiUOIiNf4_nkrfXHD_CCeaNwXtO8w
Message-ID: <CAChr6SyTWYwQ99mxMMFFodSW6N3zF3FcAPoXpCfmg51fD+EhRA@mail.gmail.com>
To: art@ietf.org
Content-Type: multipart/alternative; boundary="000000000000ceeda7064cb32bb4"
Message-ID-Hash: RBNMGZXJD5AJK3OWNLHFYGLEX4ZDCCAP
X-Message-ID-Hash: RBNMGZXJD5AJK3OWNLHFYGLEX4ZDCCAP
X-MailFrom: sayrer@gmail.com
X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; header-match-art.ietf.org-0; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header
X-Mailman-Version: 3.3.9rc6
Precedence: list
Subject: [art] Re: use of Emojis in Internet Standards
List-Id: Applications and Real-Time Area Discussion <art.ietf.org>
Archived-At: <https://mailarchive.ietf.org/arch/msg/art/xJL7VoQLAm34PDY1pi3Av5kKu4A>
List-Archive: <https://mailarchive.ietf.org/arch/browse/art>
List-Help: <mailto:art-request@ietf.org?subject=help>
List-Owner: <mailto:art-owner@ietf.org>
List-Post: <mailto:art@ietf.org>
List-Subscribe: <mailto:art-join@ietf.org>
List-Unsubscribe: <mailto:art-leave@ietf.org>
On Tue, Mar 10, 2026 at 3:33 PM Andrew Sullivan <ajs@anvilwalrusden.com> wrote: > > > https://github.com/sayrer/twitter-text/blob/main/rust/parser/src/twitter_text_full_tld.pest > > > >where every single emoji in Unicode 17 is described by hand. > > Great, but since that seems to be part of a general parser of tweets > (which last I looked are mostly arbitrary text) > This is the one point we must disagree on. Tweets are free text that contain identifiers (usernames, hashtags, etc). That's why the parser is brutal. I'd encourage anyone interested in this topic to read mine, and feel free to improve on it. thanks, Rob
- [art] use of Emojis in Internet Standards Robert Viragh
- [art] Re: use of Emojis in Internet Standards Ted Hardie
- [art] Re: use of Emojis in Internet Standards Salz, Rich
- [art] Re: use of Emojis in Internet Standards Orie
- [art] Re: use of Emojis in Internet Standards Rob Sayre
- [art] Re: use of Emojis in Internet Standards Tim Bray
- [art] Re: use of Emojis in Internet Standards S Moonesamy
- [art] Re: use of Emojis in Internet Standards Andrew Sullivan
- [art] Re: use of Emojis in Internet Standards Carsten Bormann
- [art] Re: use of Emojis in Internet Standards Patrik Fältström
- [art] Re: use of Emojis in Internet Standards Rob Sayre
- [art] Re: use of Emojis in Internet Standards Rob Sayre
- [art] Re: use of Emojis in Internet Standards Andrew Sullivan
- [art] Re: use of Emojis in Internet Standards Nico Williams
- [art] Re: use of Emojis in Internet Standards Nico Williams
- [art] Re: use of Emojis in Internet Standards Andrew Sullivan
- [art] Re: use of Emojis in Internet Standards Rob Sayre
- [art] Re: use of Emojis in Internet Standards Martin J. Dürst
- [art] Re: use of Emojis in Internet Standards Rob Sayre
- [art] Re: use of Emojis in Internet Standards Rob Sayre
- [art] Re: use of Emojis in Internet Standards Rob Sayre
- [art] Re: use of Emojis in Internet Standards John C Klensin
- [art] Re: use of Emojis in Internet Standards Martin J. Dürst
- [art] Re: use of Emojis in Internet Standards Rob Sayre
- [art] Re: use of Emojis in Internet Standards Rob Sayre
- [art] Re: use of Emojis in Internet Standards Andrew Sullivan
- [art] Re: use of Emojis in Internet Standards Martin J. Dürst