Generating Datasets With Pretrained Language Models

This document proposes a method called Datasets from Instructions (DINO) to automatically generate labeled datasets for training sentence embedding models. DINO uses large pretrained language models (PLMs) to generate text pairs and corresponding similarity labels, replacing human annotation with the PLMs' generative abilities. DINO provides the PLMs with instructions on how to generate examples for different similarity labels. It releases STS-DINO, the first dataset entirely created by a PLM without human effort. Sentence-RoBERTa trained on STS-DINO outperforms baselines on other semantic similarity benchmarks.

Uploaded by

walter hu

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

0% found this document useful (0 votes)

22 views9 pages

Generating Datasets With Pretrained Language Models

Uploaded by

walter hu

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

You are on page 1/ 9

Generating Datasets with Pretrained Language Models

Timo Schick and Hinrich Schütze

Center for Information and Language Processing
LMU Munich, Germany
[email protected]

Abstract Task: Write two sentences that mean the same thing.

To obtain high-quality sentence embeddings Sentence 1: “A man is playing a flute.”

from pretrained language models (PLMs), they Sentence 2: “He’s playing a flute.”
must either be augmented with additional pre-
arXiv:2104.07540v3 [cs.CL] 4 Oct 2021

training objectives or finetuned on a large set

Task: Write two sentences that are somewhat similar.
of labeled text pairs. While the latter ap-
proach typically outperforms the former, it re- Sentence 1: “A man is playing a flute.”
quires great human effort to generate suitable Sentence 2: “A woman has been playing the violin.”
datasets of sufficient size. In this paper, we
show how PLMs can be leveraged to obtain
high-quality sentence embeddings without the Task: Write two sentences that are on completely
different topics.
need for labeled data, finetuning or modifi-
cations to the pretraining objective: We uti- Sentence 1: “A man is playing a flute.”
lize the generative abilities of large and high- Sentence 2: “A woman is walking down the street.”
performing PLMs to generate entire datasets
of labeled text pairs from scratch, which we
Figure 1: Continuations generated by GPT2-XL with
then use for finetuning much smaller and more
D INO for three different task descriptions. We investi-
efficient models. Our fully unsupervised ap-
gate two different unsupervised approaches to generat-
proach outperforms strong baselines on sev-
ing sentence-similarity datasets: (i) The input sentence
eral semantic textual similarity datasets.1
is given and only the continuation is generated. This re-
quires that an (unlabeled) set of sentences is available.
1 Introduction (ii) Both input sentence and continuation are generated.
While pretrained language models (PLMs) achieve This does not rely on the availability of any resources.
strong results for many NLP tasks (Peters et al.,
2018; Radford et al., 2018; Devlin et al., 2019), To alleviate both problems, we explore a novel
they do not produce good sentence embeddings out approach to obtaining high-quality sentence em-
of the box (Reimers and Gurevych, 2019). Recent beddings: We mimic the creation of NLI datasets
approaches address this by augmenting or replacing by human crowdworkers (Bowman et al., 2015;
the language modeling objective with likewise un- Williams et al., 2018), but replace human annota-
supervised sentence-level objectives (e.g., Zhang tors with large PLMs. This allows us to automat-
et al., 2020; Li et al., 2020), but they typically ically create entire datasets from scratch that can
lag behind their supervised counterparts trained on be used for supervised training of much smaller
human-annotated sentence pairs. Unfortunately, models. Not only does this solve the problem of
obtaining large amounts of high-quality training limited training data, it also provides a viable path
data can be both difficult and prohibitively expen- to leverage big models like GPT-3 (Brown et al.,
sive (Bowman et al., 2015; Agirre et al., 2016). 2020) without requiring any updates to their param-
Furthermore, with larger and larger model sizes eters. As illustrated in Figure 1, our approach is
(Radford et al., 2019; Raffel et al., 2020; Brown based on recent methods for providing instructions
et al., 2020; Fedus et al., 2021), it becomes increas- to PLMs (e.g., Radford et al., 2019; Brown et al.,
ingly challenging to finetune PLMs. 2020; Schick and Schütze, 2020, 2021a). We use
1
Our code and datasets are publicly available at https: the self-debiasing approach of Schick et al. (2021)
//github.com/timoschick/dino. to ensure that each generated text pair is not only a
good fit for a given similarity label, but also not a Task: Write two sentences that iy .
good fit for other labels. We refer to our method as
Sentence 1: “x1 ”
Datasets from Instructions (D INO).
Sentence 2: “
In summary, our contributions are as follows:
• We introduce D INO, a method for automati- Figure 2: Instruction template Iy (x1 ) for similarity la-
cally generating labeled datasets of arbitrary bel y and input sentence x1 ; iy is described in Section 3.
size by providing PLMs with instructions. See Figure 1 for three instantiations of the template.

• We release STS- (read as “STS-Dino”), the

first textual similarity dataset generated com- ral language instructions for generating examples
pletely automatically, without any human an- in place of human crowdworkers, but find that their
notation effort. approach performs poorly.

• We show that Sentence-RoBERTa (Reimers 3 Datasets from Instructions

and Gurevych, 2019) trained on STS- out-
Let M be a PLM with vocabulary V , X = V ∗
performs strong baselines on several semantic
the set of all token sequences and Y a finite set of
textual similarity datasets.
semantic similarity labels. Our aim is to generate a
2 Related Work dataset Z ⊂ X × X × Y of text pairs (x1 , x2 ) with
corresponding similarity labels y. For x ∈ V and
There are many unsupervised approaches to ob- x ∈ X, we denote with pM (x | x) the probability
taining sentence embeddings, for example by av- that M assigns to x as a continuation of x.
eraging word embeddings (Mikolov et al., 2013; We first assume that we already have access to
Pennington et al., 2014; Bojanowski et al., 2017) or a set X1 ⊂ X of texts (e.g., a set of sentences that
with carefully designed sentence-level objectives are typical of the domain of interest). This is a
(Le and Mikolov, 2014; Kiros et al., 2015). Ensem- realistic setting for many real-world applications,
bling several methods improves results (Pörner and where large amounts of unlabeled text are abundant,
Schütze, 2019; Pörner et al., 2020). Recent work but it is difficult to obtain interesting and (for our
obtains sentence representations by supplementing task) useful text pairs and labels. D INO requires a
BERT (Devlin et al., 2019) or other PLMs with set of instructions I = {Iy | y ∈ Y } where each
additional unsupervised objectives (Zhang et al., Iy ∈ I is a function that, given an input x1 ∈ X1 ,
2020; Li et al., 2020; Wu et al., 2020; Giorgi et al., prompts its recipient to generate an appropriate
2020). Often, labeled datasets such as paraphrase second text x2 . We use the instruction template
databases (Wieting and Gimpel, 2018) or natural in Figure 2 and consider three levels of similarity
language inference datasets (Conneau et al., 2017; (Y = {0, 0.5, 1}), where
Cer et al., 2018; Reimers and Gurevych, 2019) are
used for supervised learning.

mean the same thing
 if y = 1
Some approaches augment existing datasets with
iy = are somewhat similar if y = 0.5
automatically generated examples (Anaby-Tavor 
are on completely different topics if y = 0

et al., 2020; Papanikolaou and Pierleoni, 2020;
Yang et al., 2020; Mohapatra et al., 2020; Kumar
is loosely based on Cer et al. (2017)’s five-level
et al., 2021), but in contrast to our work, all of
similarity scheme. Note that for all y, Iy ends with
these approaches require that there already exists a
an opening quotation mark, which allows us to treat
labeled dataset for finetuning the generator. Provid-
the first quotation mark generated by the PLM as a
ing PLMs with task descriptions for zero- or few-
sign that it is done.
shot learning has been studied extensively (e.g.,
For a given x1 ∈ X1 and y ∈ Y , we could
Radford et al., 2019; Puri and Catanzaro, 2019;
directly use the instructions Iy to obtain x2 by con-
Brown et al., 2020; Schick and Schütze, 2020,
tinuously sampling tokens
2021b,a; Weller et al., 2020; Gao et al., 2021; Tam
et al., 2021). However, none of these approaches is xk ∼ pM (xk | Iy (x1 ), x1 , . . . , xk−1 )
suitable for generating sentence embeddings.
Closely related to our work, Efrat and Levy starting from k = 1 until xk is a quotation mark
(2020) examine the ability of PLMs to follow natu- and setting x2 = x1 , . . . , xk−1 . However, we may
want the PLM to generate a text x2 that is not only 2014). For all tasks, we adopt the unsupervised
a good fit for instruction Iy (x1 ), but also not a setting without task-specific training examples.
good fit for some other instruction Iy0 (x1 ). We We use D INO to generate STS- ⊂ X×X×Y , a
refer to y 0 as a counterlabel for y and denote the dataset of text pairs with semantic similarity labels.
set of y’s counterlabels as CL(y). For example, We generate two variants:
1 ∈ CL(0.5) means that for y = 0.5, we want
• STS- -x2 , for which we make use of STSb
M to generate a sentence x2 that is similar to
to obtain a set of texts X1 ;
(y = 0.5), but at the same time does not have the
same meaning as (y = 1) sentence x1 . We achieve • STS- -x1 x2 , where the set of sentences X1
this using Schick et al. (2021)’s self-debiasing algo- is generated from scratch.
rithm: When sampling the token xk , we consider We use GPT2-XL as PLM with a decay constant
not just py = pM (xk | Iy (x1 ), x1 , . . . , xk−1 ) [xk ’s of λ = 100 and the set of counterlabels CL(y) =
probability given Iy (x1 )], but also py0 [xk ’s prob- {y 0 ∈ Y | y 0 > y}. That is, we do not restrict
ability given Iy0 (x1 )], for all y 0 ∈ CL(y). We the PLM when generating texts for y = 1, but for
penalize each token xk for which py is lower than y = 0.5 (y = 0) we encourage it not to generate
any py0 by multiplying its probability with a factor texts x2 that mean the same thing as (are somewhat
α = exp(λ · δy ) where similar to) x1 . We apply top-p (Holtzman et al.,
2020) and top-k (Fan et al., 2018; Holtzman et al.,
δy = py − max py0 2018) sampling with p = 0.9, k = 5 and generate
y 0 ∈CL(y)
up to 40 output tokens. For each x1 ∈ X1 and y ∈
is the difference between xk ’s probability given Y , we generate up to two corresponding x2 ’s.2 For
Iy (x1 ) and its maximum probability given Iy0 (x1 ) STS- -x1 x2 , we obtain X1 by generating 15,000
for any y 0 ∈ CL(y), and the decay constant λ is a sentences using only top-p sampling (again with
hyperparameter. p = 0.9) and no top-k sampling to ensure more
For settings where no set of unlabeled texts X1 diversity in the generated output. We remove all
is available, a straightforward approach would be examples where x1 = x2 (as those provide no
to use the phrase shown in Figure 2 up to and in- training signal to the model) and split the datasets
cluding the first quotation mark as an instruction 90/10 into training and validation.
to let the PLM generate both x1 and x2 . However, To assess the quality of the generated datasets,
this approach has at least two issues: First, gener- we use them to train Sentence-RoBERTa (Reimers
ated texts may not match the required schema (e.g., and Gurevych, 2019), a biencoder architecture
the model may never produce the string “Sentence based on RoBERTa (base) (Liu et al., 2019) that
2:”). Second, the set of texts x1 should ideally be measures the similarity of two texts by comput-
highly diverse, whereas we want to give the model ing the cosine similarity of their embeddings. As
less leeway when generating x2 , so we may want our datasets contain many noisy examples, we use
to use different sampling strategies for x1 and x2 . a technique similar to label smoothing (Szegedy
We solve both problems as follows: We first use et al., 2016) and replace similarity scores of 0 and
Iy (Figure 2) up to and including the first quotation 1 with 0.1 and 0.9, respectively. Additionally, for
mark (the one right after “Sentence 1:”) to generate each x1 , we sample two x2 ’s from other dataset
x1 ; we stop as soon as the model produces a quota- entries and augment the dataset with (x1 , x2 , 0).
tion mark. We run this procedure repeatedly until We use the default parameters of Reimers and
we have a sufficient number of sentences. These Gurevych (2019) with a batch size of 32 and train
are gathered into a set X1 and then we proceed for at most one epoch; the exact number of train-
exactly as in the case where X1 is already given. ing steps is determined based on Spearman’s rank
correlation on the STS- validation set.
4 Experiments
Results We compare S-RoBERTa (base) trained
We evaluate D INO on several English semantic tex- on datasets generated with D INO to S-BERT and
tual similarity datasets: the STS tasks 2012–2016 S-RoBERTa finetuned on NLI data as well as Uni-
(Agirre et al., 2012, 2013, 2014, 2015, 2016), the versal Sentence Encoder (USE) (Cer et al., 2018)
STS benchmark (STSb) (Cer et al., 2017), and the 2
As the PLM may not generate a quotation mark in the
SICK-Relatedness dataset (SICK) (Marelli et al., first 40 tokens, we use up to 5 tries to generate the two x2 ’s.
Model UD STS12 STS13 STS14 STS15 STS16 STSb SICK Avg.
InferSent, Glove – 52.86 66.75 62.15 72.77 66.87 68.03 65.65 65.01
USE – 64.49 67.80 64.61 76.83 73.18 74.92 76.69 71.22
sup.

S-BERT (base) – 70.97 76.53 73.19 79.09 74.30 77.03 72.91 74.89
S-RoBERTa (base) – 71.54 72.49 70.80 78.74 73.69 77.77 74.46 74.21
Avg. GloVe – 55.14 70.66 59.73 68.25 63.66 58.02 53.76 61.32
Avg. BERT – 38.78 57.98 57.98 63.15 61.06 46.35 58.40 54.81
BERT CLS – 20.16 30.01 20.09 36.88 38.08 16.50 42.63 29.19
unsup.

Zhang et al. (2020) NLI 56.77 69.24 61.21 75.23 70.16 69.21 64.25 66.58
Li et al. (2020) NLI 59.54 64.69 64.66 72.92 71.84 58.56 65.44 65.38
Li et al. (2020) STS 63.48 72.14 68.42 73.77 75.37 70.72 63.11 69.57
D INO (STS- -x1 x2 ) – 64.87 78.30 66.38 79.60 76.47 76.51 74.26 73.77
D INO (STS- -x2 ) STS 70.27 81.26 71.25 80.49 77.18 77.82 68.09 75.20

Table 1: Spearman’s rank correlation on STS12–16, STSb and SICK without finetuning on task-specific examples
for models with NLI supervision (“sup.”) and fully unsupervised (“unsup.”) models using the same evaluation
setup as Reimers and Gurevych (2019). The second column shows which unlabeled data (“UD”) is used by
unsupervised approaches in addition to original pretraining data; the final column shows average performance.
Results for all baselines except Zhang et al. (2020) and Li et al. (2020) are from Reimers and Gurevych (2019). The
best unsupervised result is shown in bold, the best overall result is underlined. D INO outperforms all unsupervised
approaches and, surprisingly, also supervised approaches on four out of six STS datasets.

Model STS12-16 STSb SICK We investigate the importance of self-debiasing

D INO (STS- -x2 ) 76.09 77.82 68.09 (Schick et al., 2021) in Table 2 (top); as can be
decay constant λ = 0 65.50 70.71 67.60 seen, removing self-debiasing (λ = 0) dramatically
decay constant λ = 200 75.40 77.49 66.83
no label smoothing 74.50 76.26 66.23
hurts performance. Increasing the decay constant
no augmentation 70.90 73.81 63.98 (λ = 200) leads to slightly worse performance
as the overall quality of generated sentences de-
Table 2: Effect of removing self-debiasing (λ = 0) or creases (Schick et al., 2021). Table 2 (bottom)
increasing the decay constant (λ = 200), using no label shows that training on STS- requires measures
smoothing and performing no data augmentation (sam-
to limit the effect of noisy labels: removing label
pling random x2 ’s for each x1 ) on the performance of
D INO on STS12-16 (avg), STSb and SICK
smoothing and performing no data augmentation
(i.e., not generating additional pairs (x1 , x2 , 0) by
sampling random x2 ’s for each x1 ) clearly hurts
and InferSent (Conneau et al., 2017), all of which performance.
are trained on hundreds of thousands of labeled text To further assess the quality of datasets gener-
pairs from SNLI (Bowman et al., 2015) and MNLI ated with D INO, we additionally perform a small-
(Williams et al., 2018). We additionally compare scale human evaluation. To this end, we consider
to the following fully unsupervised approaches: av- the exact version of STS- -x2 used for training
eraging word-level GloVe (Pennington et al., 2014) S-RoBERTa; that is, we perform label smoothing,
or BERT (Devlin et al., 2019) embeddings, using augmentation with randomly sampled text pairs,
BERT’s CLS token, and recent methods by Zhang and removal of trivial examples where x1 = x2 .
et al. (2020) and Li et al. (2020) based on pretrained From the resulting dataset, we randomly select
BERT models. We do not compare to approaches 100 text pairs (x1 , x2 ) and annotate them ourselves
trained with direct supervision as our focus is on with similarity scores y ∈ {0, 0.1, 0.5, 0.9}, where
obtaining sentence representations without task- we assign a score of 0.9 when x1 and x2 mean
specific labeled examples. As shown in Table 1, (almost) the same thing and a score of 0.1 when
training on datasets generated with D INO clearly they are on different topics, but still show a weak
outperforms the fully unsupervised baselines; on similarity in some aspect.
average, training on STS- -x2 even outperforms In Table 3, human annotations are compared
all approaches with NLI supervision. STS- -x2 to originally assigned scores, yielding some inter-
gives better results than STS- -x1 x2 on all STS esting insights. For one, it becomes clear why
datasets as its examples are – by design – very sim- augmentation with randomly sampled text pairs is
ilar to examples found in these datasets, while the important for good downstream task performance:
latter gives better results on SICK. Of the examples generated by D INO that are sup-
D INO Labels → 0.0 0.1 0.5 0.9 x1 = Rick Santorum notches a victory in Kansas caucuses.
3
x2 = Rick Santorum wins Kansas caucuses.

Human Labels
0.0 95% 15% 0% 0%

0.1 x1 = A man is cutting cucumbers.

0% 44% 11% 12% 3
x2 = A man is slicing cucumbers.
0.5 5% 41% 60% 41%

y=1
x1 = US closes embassy in Syria
0.9 0% 0% 29% 47% 7
x2 = US Embassy in Syria
x1 = A man is playing the cello.
Table 3: Comparison of similarity scores in STS- -x2 7
x2 = The cello is playing the man.
to human judgments for 100 examples. Examples are
chosen randomly from the version of STS- -x2 used x1 = A plane is taking off.
7
x2 = I want to be a pilot.
for training (including label smoothing, augmentation
with random pairs and removal of examples where x1 = A woman is seasoning a piece of meat.
3

y = 0.5
x2 = A man is cooking the meat and adding spices [...]
x1 = x2 ). For column i and row j, the value shown
is the percentage of examples generated by D INO for x1 = Second day of Egyptian presidential election
3
similarity score i that were assigned score j in our hu- x2 = The first night of the election.
man evaluation. x1 = A white bus with the word Julia is near water [...]
3
x2 = There is an open beach in my hometown.
x1 = Strong earthquake in Mexico
posed to be on completely different topics, many 3
x2 = It’s the best time to get a job
(41%) still have a certain similarity according to
y=0 x1 = Closed roads in Armenia
7
human judgment. In contrast, randomly sampled x2 = Open roads in Azerbaijan
pairs are indeed on completely different topics in x1 = The man is playing the guitar.
7
almost all cases. Moreover, we can see that GPT2- x2 = I’m not a guitar player.
XL has particular difficulty in generating pairs of x1 = A man is playing a large flute.
7
non-identical sentences that really mean the same x2 = A man is listening to a small flute.
thing: Only 47% of all examples that should have
Table 4: A selection of high-quality (3) and low-
the same meaning do actually mean (almost) the
quality (7) examples in STS- -x2 . Many sentence
same thing. However, the strong performance of pairs for y = 1 are not similar and have quite differ-
S-RoBERTa trained on STS- -x2 suggests that, ent meanings. Some sentence pairs for y = 0 are not
despite this noise, there is sufficient signal in this on completely different topics.
dataset for successful training.
We finally take a qualitative look at both positive
examples where D INO is able to create high-quality et al. (2021). With appropriate measures for han-
text pairs and at some typical errors found in many dling noisy data, models trained on datasets gener-
of the generated examples. As shown in Table 4, for ated with D INO achieve strong results on several
y = 1 the PLM sometimes comes up with decent semantic textual similarity datasets.
paraphrases (e.g. “notches a victory” 7→ “wins”) or For future work, it would be interesting to see
substitutes with very similar meaning (“cutting” 7→ whether the noise in datasets generated with D INO
“slicing”), but more often it generates sentences that can further be reduced, e.g., by using different
either omit or mix up important information, and sets of instructions (Jiang et al., 2020; Schick and
sometimes it produces sentences with an entirely Schütze, 2021a) or by supplementing our pipeline
different meaning. Whereas sentences generated with some additional filtering steps.
for y = 0.5 by and large look reasonable, for y = 0 Acknowledgments This work was funded by the
the PLM often simply flips words (“closed” 7→ European Research Council (ERC #740516). We
“open”, “large” 7→ “small”) instead of producing thank the anonymous reviewers for their helpful
sentences on completely different topics. comments.
5 Conclusion
We have introduced D INO, a method for using large References
PLMs to generate entire datasets of labeled sen- Eneko Agirre, Carmen Banea, Claire Cardie, Daniel
tence pairs from scratch, requiring no labeled data Cer, Mona Diab, Aitor Gonzalez-Agirre, Weiwei
Guo, Iñigo Lopez-Gazpio, Montse Maritxalar, Rada
and no parameter updates. This is achieved by Mihalcea, German Rigau, Larraitz Uria, and Janyce
providing instructions in natural language, com- Wiebe. 2015. SemEval-2015 task 2: Semantic tex-
bined with the self-debiasing method of Schick tual similarity, English, Spanish and pilot on inter-
pretability. In Proceedings of the 9th International Arvind Neelakantan, Pranav Shyam, Girish Sastry,
Workshop on Semantic Evaluation (SemEval 2015), Amanda Askell, Sandhini Agarwal, Ariel Herbert-
pages 252–263, Denver, Colorado. Association for Voss, Gretchen Krueger, Tom Henighan, Rewon
Computational Linguistics. Child, Aditya Ramesh, Daniel Ziegler, Jeffrey Wu,
Clemens Winter, Chris Hesse, Mark Chen, Eric
Eneko Agirre, Carmen Banea, Claire Cardie, Daniel Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess,
Cer, Mona Diab, Aitor Gonzalez-Agirre, Weiwei Jack Clark, Christopher Berner, Sam McCandlish,
Guo, Rada Mihalcea, German Rigau, and Janyce Alec Radford, Ilya Sutskever, and Dario Amodei.
Wiebe. 2014. SemEval-2014 task 10: Multilingual 2020. Language models are few-shot learners. In
semantic textual similarity. In Proceedings of the Advances in Neural Information Processing Systems,
8th International Workshop on Semantic Evaluation volume 33, pages 1877–1901. Curran Associates,
(SemEval 2014), pages 81–91, Dublin, Ireland. As- Inc.
sociation for Computational Linguistics.
Daniel Cer, Mona Diab, Eneko Agirre, Iñigo Lopez-
Eneko Agirre, Carmen Banea, Daniel Cer, Mona Diab, Gazpio, and Lucia Specia. 2017. SemEval-2017
Aitor Gonzalez-Agirre, Rada Mihalcea, German task 1: Semantic textual similarity multilingual and
Rigau, and Janyce Wiebe. 2016. SemEval-2016 crosslingual focused evaluation. In Proceedings
task 1: Semantic textual similarity, monolingual of the 11th International Workshop on Semantic
and cross-lingual evaluation. In Proceedings of the Evaluation (SemEval-2017), pages 1–14, Vancouver,
10th International Workshop on Semantic Evalua- Canada. Association for Computational Linguistics.
tion (SemEval-2016), pages 497–511, San Diego,
California. Association for Computational Linguis- Daniel Cer, Yinfei Yang, Sheng-yi Kong, Nan Hua,
tics. Nicole Limtiaco, Rhomni St. John, Noah Constant,
Mario Guajardo-Cespedes, Steve Yuan, Chris Tar,
Eneko Agirre, Daniel Cer, Mona Diab, Aitor Gonzalez-
Brian Strope, and Ray Kurzweil. 2018. Universal
Agirre, and Weiwei Guo. 2013. *SEM 2013 shared
sentence encoder for English. In Proceedings of
task: Semantic textual similarity. In Second Joint
the 2018 Conference on Empirical Methods in Nat-
Conference on Lexical and Computational Seman-
ural Language Processing: System Demonstrations,
tics (*SEM), Volume 1: Proceedings of the Main
pages 169–174, Brussels, Belgium. Association for
Conference and the Shared Task: Semantic Textual
Computational Linguistics.
Similarity, pages 32–43, Atlanta, Georgia, USA. As-
sociation for Computational Linguistics. Alexis Conneau, Douwe Kiela, Holger Schwenk, Loïc
Eneko Agirre, Mona Diab, Daniel Cer, and Aitor Barrault, and Antoine Bordes. 2017. Supervised
Gonzalez-Agirre. 2012. Semeval-2012 task 6: A pi- learning of universal sentence representations from
lot on semantic textual similarity. In Proceedings natural language inference data. In Proceedings of
of the First Joint Conference on Lexical and Com- the 2017 Conference on Empirical Methods in Nat-
putational Semantics - Volume 1: Proceedings of ural Language Processing, pages 670–680, Copen-
the Main Conference and the Shared Task, and Vol- hagen, Denmark. Association for Computational
ume 2: Proceedings of the Sixth International Work- Linguistics.
shop on Semantic Evaluation, SemEval ’12, page Jacob Devlin, Ming-Wei Chang, Kenton Lee, and
385–393, USA. Association for Computational Lin- Kristina Toutanova. 2019. BERT: Pre-training of
guistics. deep bidirectional transformers for language under-
Ateret Anaby-Tavor, Boaz Carmeli, Esther Goldbraich, standing. In Proceedings of the 2019 Conference
Amir Kantor, George Kour, Segev Shlomov, Naama of the North American Chapter of the Association
Tepper, and Naama Zwerdling. 2020. Do not have for Computational Linguistics: Human Language
enough data? Deep learning to the rescue! Pro- Technologies, Volume 1 (Long and Short Papers),
ceedings of the AAAI Conference on Artificial Intel- pages 4171–4186, Minneapolis, Minnesota. Associ-
ligence, 34(05):7383–7390. ation for Computational Linguistics.

Piotr Bojanowski, Edouard Grave, Armand Joulin, and Avia Efrat and Omer Levy. 2020. The turking test: Can
Tomas Mikolov. 2017. Enriching word vectors with language models understand instructions? Comput-
subword information. Transactions of the Associa- ing Research Repository, arXiv:2010.11982.
tion for Computational Linguistics, 5:135–146.
Angela Fan, Mike Lewis, and Yann Dauphin. 2018. Hi-
Samuel R. Bowman, Gabor Angeli, Christopher Potts, erarchical neural story generation. In Proceedings
and Christopher D. Manning. 2015. A large anno- of the 56th Annual Meeting of the Association for
tated corpus for learning natural language inference. Computational Linguistics (Volume 1: Long Papers),
In Proceedings of the 2015 Conference on Empiri- pages 889–898, Melbourne, Australia. Association
cal Methods in Natural Language Processing, pages for Computational Linguistics.
632–642, Lisbon, Portugal. Association for Compu-
tational Linguistics. William Fedus, Barret Zoph, and Noam Shazeer. 2021.
Switch transformers: Scaling to trillion parameter
Tom Brown, Benjamin Mann, Nick Ryder, Melanie models with simple and efficient sparsity. Comput-
Subbiah, Jared D Kaplan, Prafulla Dhariwal, ing Research Repository, arXiv:2101.03961.
Tianyu Gao, Adam Fisch, and Danqi Chen. 2021. Marco Marelli, Stefano Menini, Marco Baroni, Luisa
Making pre-trained language models better few-shot Bentivogli, Raffaella Bernardi, and Roberto Zampar-
learners. In Proceedings of the 59th Annual Meet- elli. 2014. A SICK cure for the evaluation of compo-
ing of the Association for Computational Linguistics sitional distributional semantic models. In Proceed-
and the 11th International Joint Conference on Nat- ings of the Ninth International Conference on Lan-
ural Language Processing (Volume 1: Long Papers), guage Resources and Evaluation (LREC’14), pages
pages 3816–3830, Online. Association for Computa- 216–223, Reykjavik, Iceland. European Language
tional Linguistics. Resources Association (ELRA).

John M. Giorgi, Osvald Nitski, Gary D. Bader, and Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey
Bo Wang. 2020. DeCLUTR: Deep contrastive learn- Dean. 2013. Efficient estimation of word represen-
ing for unsupervised textual representations. Com- tations in vector space. Computing Research Repos-
puting Research Repository, arXiv:2006.03659. itory, arXiv:1301.3781.

Ari Holtzman, Jan Buys, Li Du, Maxwell Forbes, and Biswesh Mohapatra, Gaurav Pandey, Danish Contrac-
Yejin Choi. 2020. The curious case of neural text de- tor, and Sachindra Joshi. 2020. Simulated chats for
generation. In International Conference on Learn- task-oriented dialog: Learning to generate conversa-
ing Representations. tions from instructions. Computing Research Repos-
itory, arXiv:2010.10216.
Ari Holtzman, Jan Buys, Maxwell Forbes, Antoine
Bosselut, David Golub, and Yejin Choi. 2018. Yannis Papanikolaou and Andrea Pierleoni. 2020.
Learning to write with cooperative discriminators. DARE: Data augmented relation extraction
In Proceedings of the 56th Annual Meeting of the As- with GPT-2. Computing Research Repository,
sociation for Computational Linguistics (Volume 1: arXiv:2004.13845.
Long Papers), pages 1638–1649, Melbourne, Aus-
Adam Paszke, Sam Gross, Soumith Chintala, Gregory
tralia. Association for Computational Linguistics.
Chanan, Edward Yang, Zachary DeVito, Zeming
Lin, Alban Desmaison, Luca Antiga, and Adam
Zhengbao Jiang, Frank F. Xu, Jun Araki, and Graham
Lerer. 2017. Automatic differentiation in PyTorch.
Neubig. 2020. How can we know what language
In NIPS Autodiff Workshop.
models know? Transactions of the Association for
Computational Linguistics, 8:423–438. Jeffrey Pennington, Richard Socher, and Christopher
Manning. 2014. GloVe: Global vectors for word
Ryan Kiros, Yukun Zhu, Russ R Salakhutdinov, representation. In Proceedings of the 2014 Confer-
Richard Zemel, Raquel Urtasun, Antonio Torralba, ence on Empirical Methods in Natural Language
and Sanja Fidler. 2015. Skip-thought vectors. In Processing (EMNLP), pages 1532–1543, Doha,
Advances in Neural Information Processing Systems, Qatar. Association for Computational Linguistics.
volume 28. Curran Associates, Inc.
Matthew Peters, Mark Neumann, Mohit Iyyer, Matt
Varun Kumar, Ashutosh Choudhary, and Eunah Cho. Gardner, Christopher Clark, Kenton Lee, and Luke
2021. Data augmentation using pre-trained trans- Zettlemoyer. 2018. Deep contextualized word rep-
former models. Computing Research Repository, resentations. In Proceedings of the 2018 Confer-
arXiv:2003.02245. ence of the North American Chapter of the Associ-
ation for Computational Linguistics: Human Lan-
Quoc Le and Tomas Mikolov. 2014. Distributed repre- guage Technologies, Volume 1 (Long Papers), pages
sentations of sentences and documents. In Proceed- 2227–2237, New Orleans, Louisiana. Association
ings of the 31st International Conference on Ma- for Computational Linguistics.
chine Learning, volume 32 of Proceedings of Ma-
chine Learning Research, pages 1188–1196, Bejing, Nina Pörner and Hinrich Schütze. 2019. Multi-
China. PMLR. view domain adapted sentence embeddings for low-
resource unsupervised duplicate question detection.
Bohan Li, Hao Zhou, Junxian He, Mingxuan Wang, In Proceedings of the 2019 Conference on Empiri-
Yiming Yang, and Lei Li. 2020. On the sentence cal Methods in Natural Language Processing and
embeddings from pre-trained language models. In the 9th International Joint Conference on Natural
Proceedings of the 2020 Conference on Empirical Language Processing, EMNLP-IJCNLP 2019, Hong
Methods in Natural Language Processing (EMNLP), Kong, China, November 3-7, 2019, pages 1630–
pages 9119–9130, Online. Association for Computa- 1641. Association for Computational Linguistics.
tional Linguistics.
Nina Pörner, Ulli Waltinger, and Hinrich Schütze. 2020.
Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Sentence meta-embeddings for unsupervised seman-
Mandar Joshi, Danqi Chen, Omer Levy, Mike tic textual similarity. In Proceedings of the 58th An-
Lewis, Luke Zettlemoyer, and Veselin Stoyanov. nual Meeting of the Association for Computational
2019. RoBERTa: A robustly optimized BERT pre- Linguistics, ACL 2020, Online, July 5-10, 2020,
training approach. Computing Research Repository, pages 7027–7034. Association for Computational
arXiv:1907.11692. Linguistics.
Raul Puri and Bryan Catanzaro. 2019. Zero-shot Orion Weller, Nicholas Lourie, Matt Gardner, and
text classification with generative language models. Matthew Peters. 2020. Learning from task descrip-
Computing Research Repository, arXiv:1912.10165. tions. Proceedings of the 2020 Conference on Em-
pirical Methods in Natural Language Processing
Alec Radford, Karthik Narasimhan, Tim Salimans, and (EMNLP).
Ilya Sutskever. 2018. Improving language under-
standing by generative pre-training. John Wieting and Kevin Gimpel. 2018. ParaNMT-
50M: Pushing the limits of paraphrastic sentence em-
Alec Radford, Jeff Wu, Rewon Child, David Luan, beddings with millions of machine translations. In
Dario Amodei, and Ilya Sutskever. 2019. Language Proceedings of the 56th Annual Meeting of the As-
models are unsupervised multitask learners. Techni- sociation for Computational Linguistics (Volume 1:
cal report. Long Papers), pages 451–462, Melbourne, Australia.
Association for Computational Linguistics.
Colin Raffel, Noam Shazeer, Adam Roberts, Kather-
ine Lee, Sharan Narang, Michael Matena, Yanqi Adina Williams, Nikita Nangia, and Samuel Bowman.
Zhou, Wei Li, and Peter J. Liu. 2020. Exploring 2018. A broad-coverage challenge corpus for sen-
the limits of transfer learning with a unified text-to- tence understanding through inference. In Proceed-
text transformer. Journal of Machine Learning Re- ings of the 2018 Conference of the North American
search, 21(140):1–67. Chapter of the Association for Computational Lin-
guistics: Human Language Technologies, Volume
Nils Reimers and Iryna Gurevych. 2019. Sentence- 1 (Long Papers), pages 1112–1122. Association for
BERT: Sentence embeddings using Siamese BERT- Computational Linguistics.
networks. In Proceedings of the 2019 Conference on
Empirical Methods in Natural Language Processing Thomas Wolf, Lysandre Debut, Victor Sanh, Julien
and the 9th International Joint Conference on Natu- Chaumond, Clement Delangue, Anthony Moi, Pier-
ral Language Processing (EMNLP-IJCNLP), pages ric Cistac, Tim Rault, Remi Louf, Morgan Funtow-
3982–3992, Hong Kong, China. Association for icz, Joe Davison, Sam Shleifer, Patrick von Platen,
Computational Linguistics. Clara Ma, Yacine Jernite, Julien Plu, Canwen Xu,
Teven Le Scao, Sylvain Gugger, Mariama Drame,
Timo Schick and Hinrich Schütze. 2020. Few-shot text Quentin Lhoest, and Alexander Rush. 2020. Trans-
generation with pattern-exploiting training. Comput- formers: State-of-the-art natural language process-
ing Research Repository, arXiv:2012.11926. ing. In Proceedings of the 2020 Conference on Em-
pirical Methods in Natural Language Processing:
Timo Schick and Hinrich Schütze. 2021a. Exploit- System Demonstrations, pages 38–45, Online. Asso-
ing cloze questions for few shot text classification ciation for Computational Linguistics.
and natural language inference. In Proceedings of
the 16th Conference of the European Chapter of Zhuofeng Wu, Sinong Wang, Jiatao Gu, Madian
the Association for Computational Linguistics, Kyiv, Khabsa, Fei Sun, and Hao Ma. 2020. CLEAR: Con-
Ukraine (Online). International Committee on Com- trastive learning for sentence representation. Com-
putational Linguistics. puting Research Repository, arXiv:2012.15466.

Timo Schick and Hinrich Schütze. 2021b. It’s not just Yiben Yang, Chaitanya Malaviya, Jared Fernandez,
size that matters: Small language models are also Swabha Swayamdipta, Ronan Le Bras, Ji-Ping
few-shot learners. In Proceedings of the 2021 Con- Wang, Chandra Bhagavatula, Yejin Choi, and Doug
ference of the North American Chapter of the Asso- Downey. 2020. Generative data augmentation for
ciation for Computational Linguistics: Human Lan- commonsense reasoning. In Findings of the Associ-
guage Technologies, pages 2339–2352, Online. As- ation for Computational Linguistics: EMNLP 2020,
sociation for Computational Linguistics. pages 1008–1025, Online. Association for Computa-
tional Linguistics.
Timo Schick, Sahana Udupa, and Hinrich Schütze.
2021. Self-diagnosis and self-debiasing: A proposal Yan Zhang, Ruidan He, Zuozhu Liu, Kwan Hui Lim,
for reducing corpus-based bias in NLP. Transac- and Lidong Bing. 2020. An unsupervised sentence
tions of the Association for Computational Linguis- embedding method by mutual information maxi-
tics. mization. In Proceedings of the 2020 Conference
on Empirical Methods in Natural Language Process-
C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and ing (EMNLP), pages 1601–1610, Online. Associa-
Z. Wojna. 2016. Rethinking the inception architec- tion for Computational Linguistics.
ture for computer vision. In 2016 IEEE Confer-
ence on Computer Vision and Pattern Recognition
(CVPR), pages 2818–2826.

Derek Tam, Rakesh R Menon, Mohit Bansal, Shashank

Srivastava, and Colin Raffel. 2021. Improving and
simplifying pattern exploiting training. Computing
Research Repository, arXiv:2103.11955.
A Experimental Setup Wu et al. (2020) on STS12–16; results are shown
in Table 5.
Our implementation is based on the Transformers
library (Wolf et al., 2020) and PyTorch (Paszke Model STS12 STS13 STS14 STS15 STS16 Avg.
et al., 2017). All our experiments were con- CLEAR 49.0 48.9 57.4 63.6 65.6 56.9
ducted using two GPUs with 11GB RAM (NVIDIA DeCLUTR 64.2 70.4 70.0 77.5 75.4 71.5
STS- -x1x2 65.1 69.9 68.6 76.3 76.6 71.3
GeForce GTX 1080 Ti). Generating STS- -x1 x2 STS- -x2 65.3 71.8 72.7 75.9 76.9 72.5
and STS- -x2 using both GPUs took approxi-
mately 48 hours per dataset. Training a Sentence Table 5: Results for CLEAR (Wu et al., 2020), DeCLUTR
Transformer on these datasets took less than 2 (Giorgi et al., 2020) and Sentence-RoBERTa (base) trained
hours on average. on STS- -x1x2 and STS- -x2 using the evaluation setup
of Wu et al. (2020) and Giorgi et al. (2020): For each task,
B Datasets we report the mean Spearman correlation of all subtasks in
a fully unsupervised setting.
Both datasets generated with D INO (STS- -x1 x2
and STS- -x2 ) are publicly available at https:
//github.com/timoschick/dino. After
filtering out examples where the language model
did not produce a quotation mark, STS- -x2 con-
tains 121,275 examples and STS- -x1 x2 contains
143,968 examples.

C Additional Results
Our main results do not include scores for De-
CLUTR (Giorgi et al., 2020) and CLEAR (Wu
et al., 2020) – two recent approaches using con-
trastive learning – as their evaluation setup dif-
fers from that described in Reimers and Gurevych
(2019) (and used by all other baselines) in the fol-
lowing respects:

• Both Giorgi et al. (2020) and Wu et al. (2020)

treat SICK and STSb as supervised tasks, i.e.,
they use the provided task-specific training
sets to perform regular supervised training.

• The STS12–16 datasets each consist of sev-

eral subsets. Giorgi et al. (2020) and Wu et al.
(2020) compute Spearman’s correlation co-
efficient separately for each of these subsets
and report the mean score across all subsets.
In contrast, for our main results we follow
Reimers and Gurevych (2019) and concate-
nate all subsets to form one large set on which
Spearman’s correlation is computed just once.

As the implementations of both methods are not

publicly available as of this writing, we are unable
to compute scores for DeCLUTR and CLEAR us-
ing the evaluation setup of Reimers and Gurevych
(2019) ourselves. Instead, we recompute scores for
D INO (both with STS- -x2 and STS- -x1 x2 ) us-
ing the evaluation setup of Giorgi et al. (2020) and

Survey On LLM
No ratings yet
Survey On LLM
9 pages
Advancement in NLP Paper
No ratings yet
Advancement in NLP Paper
49 pages
Pre Train
No ratings yet
Pre Train
35 pages
2023.emnlp Main.96SynthIE
No ratings yet
2023.emnlp Main.96SynthIE
20 pages
Recent Advances in Natural Language Processing Via Large Pre-Trained Language Models-A Survey
No ratings yet
Recent Advances in Natural Language Processing Via Large Pre-Trained Language Models-A Survey
40 pages
GLM: General Language Model Pretraining With Autoregressive Blank Infilling
No ratings yet
GLM: General Language Model Pretraining With Autoregressive Blank Infilling
16 pages
3 Paradigm 2: Prompt-Based Learning: Table 2: Example Prompt Designs For Learning From In-Structions
No ratings yet
3 Paradigm 2: Prompt-Based Learning: Table 2: Example Prompt Designs For Learning From In-Structions
10 pages
song19d
No ratings yet
song19d
11 pages
Unsupervised Learning of Sentence Embeddings Using Compositional N-Gram Features
No ratings yet
Unsupervised Learning of Sentence Embeddings Using Compositional N-Gram Features
11 pages
Context - Tuning
No ratings yet
Context - Tuning
15 pages
Retgen: A Joint Framework For Retrieval and Grounded Text Generation Modeling
No ratings yet
Retgen: A Joint Framework For Retrieval and Grounded Text Generation Modeling
15 pages
2022.Acl-long.292-Prompt-based Data Augmentation for Low-Resource NLU Tasks
No ratings yet
2022.Acl-long.292-Prompt-based Data Augmentation for Low-Resource NLU Tasks
14 pages
Los Modelos Basados en Indicaciones Realmente Entienden El Significado de Sus Indicaciones
No ratings yet
Los Modelos Basados en Indicaciones Realmente Entienden El Significado de Sus Indicaciones
45 pages
CI: Comprehensive Instruction For Few-Shot Learning in Task-Oriented Dialog Systems
No ratings yet
CI: Comprehensive Instruction For Few-Shot Learning in Task-Oriented Dialog Systems
15 pages
(2303.18223) A Survey of Large Language Models
No ratings yet
(2303.18223) A Survey of Large Language Models
115 pages
Trend
No ratings yet
Trend
47 pages
GTE
No ratings yet
GTE
18 pages
Bouchard, Stenetorp, Riedel - Unknown - Learning To Generate Textual Data
No ratings yet
Bouchard, Stenetorp, Riedel - Unknown - Learning To Generate Textual Data
9 pages
Pre Trained Models For NLP
No ratings yet
Pre Trained Models For NLP
15 pages
XPrompt-Exploring the Extreme of Prompt Tuning
No ratings yet
XPrompt-Exploring the Extreme of Prompt Tuning
15 pages
29809-Article Text-33863-1-2-20240324
No ratings yet
29809-Article Text-33863-1-2-20240324
9 pages
GPT Self-Supervision For A Better Data Annotator: Preprint. Under Review
No ratings yet
GPT Self-Supervision For A Better Data Annotator: Preprint. Under Review
15 pages
Conneau, A., Et Al. (2017) - Supervised Learning of Universal Sentence Representations From Natural Language Inference Data. EMNLP
No ratings yet
Conneau, A., Et Al. (2017) - Supervised Learning of Universal Sentence Representations From Natural Language Inference Data. EMNLP
12 pages
PIIS2589004224005558
No ratings yet
PIIS2589004224005558
24 pages
Improving Language Understanding by Generative Pre-Training
No ratings yet
Improving Language Understanding by Generative Pre-Training
12 pages
Conneau Et Al. - 2017 - Supervised Learning of Universal Sentence Representations From Natural Language Inference Data
No ratings yet
Conneau Et Al. - 2017 - Supervised Learning of Universal Sentence Representations From Natural Language Inference Data
12 pages
Transfer Learning in Natural Language Processing PDF
0% (1)
Transfer Learning in Natural Language Processing PDF
238 pages
DeepLearning ACL2012 Tutorial
No ratings yet
DeepLearning ACL2012 Tutorial
7 pages
2302.03269v3
No ratings yet
2302.03269v3
25 pages
Unsupervised Graph-Text
No ratings yet
Unsupervised Graph-Text
15 pages
Bert
No ratings yet
Bert
10 pages
Text and Code Embeddings by Contrastive Pre-Training
No ratings yet
Text and Code Embeddings by Contrastive Pre-Training
13 pages
PLACES: Prompting Language Models For Social Conversation Synthesis
No ratings yet
PLACES: Prompting Language Models For Social Conversation Synthesis
25 pages
N19-1213
No ratings yet
N19-1213
7 pages
One Embedder, Any Tasks
No ratings yet
One Embedder, Any Tasks
18 pages
PPL MCTS
No ratings yet
PPL MCTS
15 pages
GPT1
No ratings yet
GPT1
12 pages
Adapting Large Language Models Via Reading Comprehension
No ratings yet
Adapting Large Language Models Via Reading Comprehension
30 pages
Recent Advances of Foundation Language Models-Based Continual Learning - A Survey
No ratings yet
Recent Advances of Foundation Language Models-Based Continual Learning - A Survey
40 pages
Adapting Large Language Models Via
No ratings yet
Adapting Large Language Models Via
26 pages
Identifying Machine-Paraphrased Plagiarism: Bibtex Ris Enw
No ratings yet
Identifying Machine-Paraphrased Plagiarism: Bibtex Ris Enw
22 pages
Aipaper2
No ratings yet
Aipaper2
14 pages
Discourse-Aware Soft Prompting For Text Generation
No ratings yet
Discourse-Aware Soft Prompting For Text Generation
20 pages
All NLP Tasks Are Generation Tasks: A General Pretraining Framework
No ratings yet
All NLP Tasks Are Generation Tasks: A General Pretraining Framework
14 pages
Chen et al. (2020) KGPT- Knowledge-Grounded Pre-training for Data-to-Text Generation
No ratings yet
Chen et al. (2020) KGPT- Knowledge-Grounded Pre-training for Data-to-Text Generation
14 pages
Downloed Papers
No ratings yet
Downloed Papers
700 pages
Kalyan 1 s2.0 S2949719123000456 Main
No ratings yet
Kalyan 1 s2.0 S2949719123000456 Main
48 pages
No Training Required Exploring Random Encoders For Sentence Classification
No ratings yet
No Training Required Exploring Random Encoders For Sentence Classification
16 pages
Paraphrase Generation With Deep RL
No ratings yet
Paraphrase Generation With Deep RL
14 pages
Survey On Large Language Models
No ratings yet
Survey On Large Language Models
52 pages
Garbacea 22 A
No ratings yet
Garbacea 22 A
17 pages
BLSP: B L - S P - B A C - W: Ootstrapping Anguage Peech RE Training Via Ehavior Lignment of Ontinu Ation Riting
No ratings yet
BLSP: B L - S P - B A C - W: Ootstrapping Anguage Peech RE Training Via Ehavior Lignment of Ontinu Ation Riting
11 pages
Prompt-Learning For Short Text Classification
No ratings yet
Prompt-Learning For Short Text Classification
8 pages
2211.05994v4 (1)
No ratings yet
2211.05994v4 (1)
14 pages
Instruction Position Matters in Sequence Generation With Large Language Models
No ratings yet
Instruction Position Matters in Sequence Generation With Large Language Models
11 pages
Meta-Learning the Difference Preparing
No ratings yet
Meta-Learning the Difference Preparing
17 pages
A Survey Large Language Models
No ratings yet
A Survey Large Language Models
58 pages
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
From Everand
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
Timothy King
No ratings yet
Deep Learning: Fundamentals and Applications
From Everand
Deep Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
The Rust Guide to Generative AI
From Everand
The Rust Guide to Generative AI
Anand Vemula
No ratings yet
Analytical Study of Certain Magnetohydrodynamic-α Models: Jasmine S. Linshiz and Edriss S. Titi
No ratings yet
Analytical Study of Certain Magnetohydrodynamic-α Models: Jasmine S. Linshiz and Edriss S. Titi
26 pages
Standard Details Drawings-Contract - 1 - 42
No ratings yet
Standard Details Drawings-Contract - 1 - 42
1 page
Decimation of The Dyson-Ising Ferromagnet: Aernout Van Enter and Arnaud Le Ny September 24, 2018
No ratings yet
Decimation of The Dyson-Ising Ferromagnet: Aernout Van Enter and Arnaud Le Ny September 24, 2018
18 pages
Challenges Changes Choices
No ratings yet
Challenges Changes Choices
32 pages
1609 00027
No ratings yet
1609 00027
85 pages
Large-Small Equivalence in String Theory
No ratings yet
Large-Small Equivalence in String Theory
12 pages
The Kastler-Kalau-Walze Type Theorem For 6-Dimensional Manifolds With Boundary
No ratings yet
The Kastler-Kalau-Walze Type Theorem For 6-Dimensional Manifolds With Boundary
15 pages
Cohomology and Versal Deformations of Hom-Leibniz Algebras
No ratings yet
Cohomology and Versal Deformations of Hom-Leibniz Algebras
22 pages
Sharp Complexity Phase Transitions Generated by Entanglement
No ratings yet
Sharp Complexity Phase Transitions Generated by Entanglement
21 pages
Parent Booklet - en
No ratings yet
Parent Booklet - en
29 pages
1703.06259
No ratings yet
1703.06259
28 pages
A Semiclassical Heat Kernel Proof of The Poincaré-Hopf Theorem
No ratings yet
A Semiclassical Heat Kernel Proof of The Poincaré-Hopf Theorem
29 pages
2006 08250
No ratings yet
2006 08250
79 pages
Intersection Theory, Integrable Hierarchies and Topological Field Theory
No ratings yet
Intersection Theory, Integrable Hierarchies and Topological Field Theory
73 pages
T. Tao Derived Similar Results Independently
No ratings yet
T. Tao Derived Similar Results Independently
37 pages
2109 02851
No ratings yet
2109 02851
34 pages
1011 1958
No ratings yet
1011 1958
59 pages
Hyponormality and Subnormality of Block Toeplitz Operators: Ra Ul E. Curto, in Sung Hwang and Woo Young Lee
No ratings yet
Hyponormality and Subnormality of Block Toeplitz Operators: Ra Ul E. Curto, in Sung Hwang and Woo Young Lee
49 pages
1 (Log Q) 1.9828m
No ratings yet
1 (Log Q) 1.9828m
16 pages
Template-Based Named Entity Recognition Using BART: Person Location Location
No ratings yet
Template-Based Named Entity Recognition Using BART: Person Location Location
11 pages
1011 4459
No ratings yet
1011 4459
18 pages
1104 3047
No ratings yet
1104 3047
14 pages
0911 4469
No ratings yet
0911 4469
8 pages
Factual Probing Is (MASK) : Learning vs. Learning To Recall
No ratings yet
Factual Probing Is (MASK) : Learning vs. Learning To Recall
17 pages
0801 1568
No ratings yet
0801 1568
66 pages
Loop and Surface Operators in N 2 Gauge Theory and Liouville Modular Geometry
No ratings yet
Loop and Surface Operators in N 2 Gauge Theory and Liouville Modular Geometry
61 pages
Localization For Wilson Loops in Chern-Simons Theory
No ratings yet
Localization For Wilson Loops in Chern-Simons Theory
228 pages
3 2Cm CM 18
No ratings yet
3 2Cm CM 18
24 pages
Knowprompt: Knowledge-Aware Prompt-Tuning With Synergistic Optimization For Relation Extraction
No ratings yet
Knowprompt: Knowledge-Aware Prompt-Tuning With Synergistic Optimization For Relation Extraction
11 pages
0901 4721
No ratings yet
0901 4721
16 pages
PR2 - Q2 - Lesson 6 - Data Analysis
No ratings yet
PR2 - Q2 - Lesson 6 - Data Analysis
17 pages
Per Dev 3 New Module
No ratings yet
Per Dev 3 New Module
7 pages
#Assessing Work System Structure 2 Idt Tam
No ratings yet
#Assessing Work System Structure 2 Idt Tam
23 pages
The Evolution of The Family
No ratings yet
The Evolution of The Family
2 pages
Introduction - Decision Making
No ratings yet
Introduction - Decision Making
13 pages
Kindergarten Action Research Report
No ratings yet
Kindergarten Action Research Report
17 pages
White Noise Windows Data Augmentation For Time Series
No ratings yet
White Noise Windows Data Augmentation For Time Series
5 pages
Final Report
No ratings yet
Final Report
27 pages
Jhs Class Schedule: (S) Ap (S/C) Science (S) Computer (S) Clve
No ratings yet
Jhs Class Schedule: (S) Ap (S/C) Science (S) Computer (S) Clve
10 pages
FS1 EP16 (Gregori, BEED 4-A)
No ratings yet
FS1 EP16 (Gregori, BEED 4-A)
14 pages
Lesson Plan 317 - Music With Transportation
No ratings yet
Lesson Plan 317 - Music With Transportation
3 pages
SCS Midterm
No ratings yet
SCS Midterm
2 pages
Materi Wps Office
No ratings yet
Materi Wps Office
6 pages
Introduction of Psychology Lesson Plan
No ratings yet
Introduction of Psychology Lesson Plan
8 pages
Talogy Effective Leadership Progression Executive Summary2024
No ratings yet
Talogy Effective Leadership Progression Executive Summary2024
12 pages
TCW Reviewer
No ratings yet
TCW Reviewer
13 pages
Evolutionary Clustering
No ratings yet
Evolutionary Clustering
7 pages
Interdisciplinary Studies - Virtual Communication
No ratings yet
Interdisciplinary Studies - Virtual Communication
3 pages
Module 1 SERV100
No ratings yet
Module 1 SERV100
14 pages
Module 1 Tabular Review
No ratings yet
Module 1 Tabular Review
39 pages
CAS Reflection Prompts
No ratings yet
CAS Reflection Prompts
2 pages
Building and Enhancing New Literacies Across The Curriculum
No ratings yet
Building and Enhancing New Literacies Across The Curriculum
48 pages
Module 1 - Lesson 5 2022
No ratings yet
Module 1 - Lesson 5 2022
11 pages
The Measure of Psychometrics
No ratings yet
The Measure of Psychometrics
3 pages
UCSP Reviewer
No ratings yet
UCSP Reviewer
3 pages
Web Site Evaluation Rubrics For K12
No ratings yet
Web Site Evaluation Rubrics For K12
3 pages
AP Psychology Scope and Sequence
No ratings yet
AP Psychology Scope and Sequence
2 pages
Assignment 2 8606
No ratings yet
Assignment 2 8606
7 pages
Article 32106
No ratings yet
Article 32106
7 pages
An Overview of Categorization Techniques: B. Mahalakshmi, Dr. K. Duraiswamy
No ratings yet
An Overview of Categorization Techniques: B. Mahalakshmi, Dr. K. Duraiswamy
7 pages