| ||||||
I notice that the Crapwatch uses a 2-level hierarchy, rather than a 3-level hierarchy as above. The 3-level hierarchy makes it clear if something is a typo, or if something is a direct match to something in WP:CRAPWATCH/SETUP.
For instance
Rank | Target/Group | Entries (Citations, Articles) | Total Citations | Distinct Articles | Citations/article
|
---|---|---|---|---|---|
148 | Blaze Media [WP:RSP § Generally unreliable] WP:RSP#Blaze Media |
|
11 | 7 | 1.571 |
would be much better understood as
Rank | Target/Group | Entries (Citations, Articles) | Total Citations | Distinct Articles | Citations/article
|
---|---|---|---|---|---|
148 | Blaze Media [WP:RSP § Generally unreliable] WP:RSP#Blaze Media |
11 | 7 | 1.571 |
since Blaze Magazine is a typo/variant of The Blaze (magazine)
Likewise with Hindawi, you have
Rank | Target/Group | Entries (Citations, Articles) | Total Citations | Distinct Articles | Citations/article
|
---|---|---|---|---|---|
4 | Hindawi Publishing Corporation [Beall's publisher list*] Originally listed on Beall's list, but later removed as a 'borderline case' |
...
... |
2478 | 2097 | 1.182 |
which would be a lot clearer to understand (to humans) why Int J Inflam was listed if it was under
Rank | Target/Group | Entries (Citations, Articles) | Total Citations | Distinct Articles | Citations/article
|
---|---|---|---|---|---|
4 | Hindawi Publishing Corporation [Beall's publisher list*] Originally listed on Beall's list, but later removed as a 'borderline case' |
...
... |
2478 | 2097 | 1.182 |
Int J Inflam would still be only counted once in the statistics, even if it was listed twice. Headbomb {t · c · p · b} 09:50, 18 March 2019 (UTC)
I just realize that I don't think I remember you uploading the JL-Bot code publicly? It's a fairly advance piece of software now, and I'm starting to get worrying about the bus factor here. Would you be willing to put the code up somewhere (possibly in a {{infobox bot}} on the bot's userpage)? Headbomb {t · c · p · b} 21:18, 12 November 2019 (UTC)
It would be useful if we could exclude bluelinks/redlinks from matching with {{JCW-pattern}}. For example,
{{JCW-pattern|Online|*Online*|!Nonlinear!|exclude=bluelinks}}
would only match redlinks. This would be useful in the case of something like
which would exclude the first four entries, but not the last one. Conversely,
{{JCW-pattern|Online|*Online*|!Nonlinear!|exclude=redlinks}}
would only match bluelinks, and in this case, keep the first four entries, but exclude the last one. Headbomb {t · c · p · b} 12:50, 19 November 2019 (UTC)
What's this new section? What is its purpose / How does it work? Headbomb {t · c · p · b} 10:45, 24 November 2019 (UTC)
In WP:JCW/Publisher5#Mary Ann Liebert you have
and then later
The second entry comes from a {{doi-inline}} template, and isn't properly merged into the main grouping. Headbomb {t · c · p · b} 12:15, 18 December 2019 (UTC)
Would be a good idea to do runs if Category:Redirects from DOI prefixes has new/different members in it. I don't believe anything would change except for |registrant=
in the compilation, so maybe a seperate subroutine to just sync |registrant=
with the category would be enough. Headbomb {t · c · p · b} 15:44, 10 January 2020 (UTC)
@JLaTondre: I think the bot chocked last night. Headbomb {t · c · p · b} 11:49, 18 January 2020 (UTC)
Also User:JL-Bot/DOI could be updated with every dump (with the new-template based format). Headbomb {t · c · p · b} 09:48, 16 February 2020 (UTC)
@JLaTondre: very useful. I've removed the 37000s to get a more representative sense of what a typical delta would be. Whatever frequency we settle on for the JL-Bot/DOI updates, uploading a delta automatically would be very useful. Headbomb {t · c · p · b} 00:17, 27 February 2020 (UTC)
Now that we have a substantial amount of DOIs, it would be good if the bot automatically 'selected' publishers and journals based on Category:Redirects from DOI prefixes.
For example 10.1068 has this
#REDIRECT[[SAGE Publishing]] {{R from DOI prefix|registrant=Pion Ltd}}
For SAGE Publishing, this would basically be every Redirects from DOI prefixes that points to SAGE Publishing (with each |registrant=
found in those redirects listed as |imprint#=
) or which has |registrant=SAGE Publishing
(in this case nothing)
{{JCW-selected |SAGE Publishing |imprint1=Pion Ltd |doi1=10.1068 |doi2=10.1106 |doi3=10.1177 |doi4=10.1191 |doi5=10.1243 |doi6=10.1258 |doi7=10.1345 |doi8=10.1354 |doi9=10.1369 |doi10=10.1622 |doi11=10.1630 |doi12=10.2182 |doi13=10.2189 | |doi14=10.2511 |doi15=10.2968 |doi16=10.3317 |doi17=10.3821 |doi18=10.4135 |doi19=10.4137 |doi20=10.4219 |doi21=10.5034 |doi22=10.5126 |doi23=10.5193 |doi24=10.5301 |doi25=10.5367 |doi26=10.7182 |doi27=10.17322 |doi28=10.31124}}
For Pion Ltd, this would basically be everything that points to Pion Ltd (in this case nothing, since it redirects to SAGE Publishing) or has |registrant=Pion Ltd
(10.1068)
{{JCW-selected|Pion Ltd|parent1=SAGE Publishing|doi1=10.1068}}