Wikifunctions:Suggest a function

From Wikifunctions

Do you have an idea for a new function? Suggest it here! It may help to refer to our glossary.

You can go and create a function right away if you have the user-rights, and it aligns with other work.

Note that for now we only support strings, Booleans, Natural numbers, and lists as input and output types of functions. More types are coming in the next few months.

Once created, consider adding new Functions to the catalogue.

Proposed functions requiring only available types (string, Boolean, Natural number, list)

String

String character discard functions

String character replacement functions

String search functions

String escaping and unescaping functions

String encoding and decoding functions

String presentation functions

String colour notation functions

String notation validation checks

String validation checks

String analysis functions

Monolingual text

String Wikitext operations

...

Natural number

List

Basic list/iterable functions

  • right fold (Z12753): combine the first element with the result of recursively combining the rest, according to a combining function
  • left fold (Z12781): combine the result of recursively combining the rest, with the last element, according to a combining function
  • group

Complex list functions

  • all distinct permutations (Z12745): returns a list of all distinct permutations of the input list, each is itself a list
  • zip lists together: for [ A .. Z ] and [ 1 .. 26 ] return [ [ A, 1 ], [ B, 2 ], .. ]
    • Unsure what happens if input lists are of different lengths.
    • If possible this function should be able to zip more than 2 lists together... 3, 4, n? Perhaps the input should be list(list, list, list, list, ..).

CSV list operations

  • Done csv to list of strings (Z12794): Converts a validly formatted (RFC 4180) comma-separated value series into a valid list of strings (not including the commas or row start or row end characters), where any whitespace and quotes are unchanged. Be careful to validly render CSV with quoted fields.
  • list of strings to csv

Functions with functions as arguments

  • Reduce Function (Z876): iterates the application of a two-parameter function, the first parameter uses the initial object or the previous result, the second parameter uses the next item on the list
  • right fold (Z12753): combine the first element with the result of recursively combining the rest, according to a combining function
  • left fold (Z12781): combine the result of recursively combining the rest, with the last element, according to a combining function

Morphological functions

morphology is the part of linguistics that studies how language parts are 'shaped' and change diachronically and when inflected. Hausa, Igbo, Malayalam, Bangla/Bengali and Dagbani are focus languages for Wikidata's lexicographic dataset, which is an important aspect of Abstract Wikipedia.

mul - Multiple languages

ase - American Sign Language

  • string: Stokoe to ase-Sgnw and vice-versa (consult @Slevinski: as to best approach)

bn - Bangla

cy - Welsh

w:en:colloquial Welsh morphology

dag - Dagbani

de - German

en - English

  • English verb to agent noun (Z11390) Verb -> agent noun, e.g. "dance"->"dancer"
  • Done English nominative to accusative (Z11651): converts a nominative (subject) pronoun to the accusative (objective) case
  • Done suffix English word (Z13254): Add any suffix to an English word with regular changes to spelling
  • Join English morphemes (extends suffix English word (Z13254) to cases like re + en + able + er + s → re-enablers. suffix English word (Z13254) will correctly join re-enable + ers or re- + enablers, but re + enablers → “renablers” (incorrect). English morpheme agglutination (Z13275) tests the Reduce function to produce “detoxification” from a list of four morphemes (orchestrator limit exceeded with five). I doubt we’ll want to derive “toxify” from “toxic”, however.
  • Derive lemmas from a form. This is envisaged as the converse of Join English morphemes. The focus would be identifying the base form (the lexeme’s lemma) rather than further segmenting the lemma. For example, “underlay” should return “underlie” (for which it is the past participle) and the noun “underlay” (for which it is the lemma) and (perhaps) the verb “underlay”, which might be the tendency of an unproductive hen or the activity of a carpet-fitter. As this is a purely functional converse, every string will have itself as a possible lemma.
  • Generate Numerical prefixes of various kinds from a natural number input.

fr - French

ha - Hausa

A notated demo sentence ("Aishà taa jeefar dà kàren Indoo" ― "Aisha threw away Indo's dog") is available at http://intent.xigt.org

ig - Igbo

ldn - Láadan

section moved to WF:human languages/Z1882

ml - Malayalam

Proposed functions requiring future types

Note these functions cannot be implemented properly until the needed types are requested and approved.

If one wishes to nevertheless attempt to define and implement them,

  • the functions and implementations should indicate prominently in their labels that their input/output types must be adjusted once support for the appropriate replacement types become available; and
  • the functions should not be used in the implementations of any other functions, as the later adjustment of input/output types to appropriate replacements will break those implementations.

String manipulation functions

String analysis functions

  • count distance between two letters in given alphabet (default to 26-charcater western alphabet. case insensitive. e.g. "a" & "A" ⇒ 0; "K" & "N" ⇒ 3)
  • Done Hamming distance between two strings of equal length, e.g. "Wikipedia" & "Wikimedia" ⇒ 1. - hamming distance between two strings (Z11328)
  • Levenshtein distance between two strings (e.g. "kitten" to "sitting" => 3 : kitten > sitten > sittin > sitting)

String encoding and decoding functions

(would be better with types representing a stream of bytes)

Natural language functions

Cryptographic hash functions

(would be better with types representing a stream of bytes)

Colour functions

  • return colour contrast ratio (per [1]) of two RGB colours (provided as strings e.g. "#FF0000")

Date, time, and calendric functions

Note: 'time' type not yet supported, use 'string' (or for strictly numeric values, 'natural number')

Bengali calendar

Gregorian to Bangla (Z12926): Returns the Bangla equivalent date of a Gregorian date format. The input fields represent the Year, month, and day, respectively.

Chinese calendar

French Republican Calendar

decimalises and secularises the Gregorian

Gregorian

widely used calendar derived from the Julian, basis for ISO 8601

Holocene calendar

Indian national calendar

Islamic

a solar calendar, also called Hijri

Julian

mostly used by astronomers, some historians, and some Orthodox Christian denominations

Mesoamerican calendars

including civil and clerical forms

Persian

also called Jalali

Thai calendar

Hebrew calendar

Basic numerical functions

Data serialization functions

Basic list/iterable functions requiring numeric types

Geodetics functions

w:en:planetary coordinate system, w:en:well-known text representation of coordinate reference systems

Earth

  • convert coordinates outside of the ranges (-180, 180) for longitude and (-90, 90) for latitude to a canonical form

Mars

  • convert coordinates outside of the ranges [0, 360) for longitude and (-90, 90) for latitude to a canonical form

Unit conversion functions

External function lists