<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://wiki.docking.org/index.php?action=history&amp;feed=atom&amp;title=Sun_Grid_Engine_%28SGE%29</id>
	<title>Sun Grid Engine (SGE) - Revision history</title>
	<link rel="self" type="application/atom+xml" href="http://wiki.docking.org/index.php?action=history&amp;feed=atom&amp;title=Sun_Grid_Engine_%28SGE%29"/>
	<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Sun_Grid_Engine_(SGE)&amp;action=history"/>
	<updated>2026-04-09T19:39:48Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.39.1</generator>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Sun_Grid_Engine_(SGE)&amp;diff=9867&amp;oldid=prev</id>
		<title>Benrwong at 20:56, 23 January 2017</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Sun_Grid_Engine_(SGE)&amp;diff=9867&amp;oldid=prev"/>
		<updated>2017-01-23T20:56:51Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 20:56, 23 January 2017&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l444&quot;&gt;Line 444:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 444:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;To view jobs running on host queue:&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;To view jobs running on host queue:&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;  qhost -h &amp;lt;hostname&amp;gt; -j  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;  qhost -h &amp;lt;hostname&amp;gt; -j  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;==External Links==&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Add/Remove Administrative, Execution, Submit Hosts: http://gridscheduler.sourceforge.net/howto/commontasks.html&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Category: Sysadmin]]&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Category: Sysadmin]]&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key wikidb:diff::1.12:old-9606:rev-9867 --&gt;
&lt;/table&gt;</summary>
		<author><name>Benrwong</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Sun_Grid_Engine_(SGE)&amp;diff=9606&amp;oldid=prev</id>
		<title>Benrwong at 18:03, 20 September 2016</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Sun_Grid_Engine_(SGE)&amp;diff=9606&amp;oldid=prev"/>
		<updated>2016-09-20T18:03:29Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 18:03, 20 September 2016&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l438&quot;&gt;Line 438:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 438:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;  Jan 20 14:37:06 pan puppet-agent[2021]: Could not request certificate: Connection refused - connect(2)&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;  Jan 20 14:37:06 pan puppet-agent[2021]: Could not request certificate: Connection refused - connect(2)&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;This was what happened when I restarted the machine.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;This was what happened when I restarted the machine.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;==Sun Grid Engine Commands==&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;To disable a host from queue:&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; qmod -d &#039;*@&amp;lt;hostname&gt;&#039;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;To view jobs running on host queue:&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; qhost -h &amp;lt;hostname&gt; -j &lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Category: Sysadmin]]&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Category: Sysadmin]]&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key wikidb:diff::1.12:old-9441:rev-9606 --&gt;
&lt;/table&gt;</summary>
		<author><name>Benrwong</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Sun_Grid_Engine_(SGE)&amp;diff=9441&amp;oldid=prev</id>
		<title>Benrwong at 18:58, 28 June 2016</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Sun_Grid_Engine_(SGE)&amp;diff=9441&amp;oldid=prev"/>
		<updated>2016-06-28T18:58:54Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 18:58, 28 June 2016&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l438&quot;&gt;Line 438:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 438:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;  Jan 20 14:37:06 pan puppet-agent[2021]: Could not request certificate: Connection refused - connect(2)&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;  Jan 20 14:37:06 pan puppet-agent[2021]: Could not request certificate: Connection refused - connect(2)&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;This was what happened when I restarted the machine.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;This was what happened when I restarted the machine.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;[[Category: Sysadmin]]&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key wikidb:diff::1.12:old-9439:rev-9441 --&gt;
&lt;/table&gt;</summary>
		<author><name>Benrwong</name></author>
	</entry>
	<entry>
		<id>http://wiki.docking.org/index.php?title=Sun_Grid_Engine_(SGE)&amp;diff=9439&amp;oldid=prev</id>
		<title>Benrwong: Creating page based on &quot;ALL ABOUT SGE (SUN GRID ENGINE)&quot; from Lab Manual</title>
		<link rel="alternate" type="text/html" href="http://wiki.docking.org/index.php?title=Sun_Grid_Engine_(SGE)&amp;diff=9439&amp;oldid=prev"/>
		<updated>2016-06-28T18:56:08Z</updated>

		<summary type="html">&lt;p&gt;Creating page based on &amp;quot;ALL ABOUT SGE (SUN GRID ENGINE)&amp;quot; from Lab Manual&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;ALL ABOUT SGE (SUN GRID ENGINE)&lt;br /&gt;
&lt;br /&gt;
To add an exec node:&lt;br /&gt;
  yum -y install gridengine gridengine-execd&lt;br /&gt;
  export SGE_ROOT=/usr/share/gridengine&lt;br /&gt;
  export SGE_CELL=bkslab&lt;br /&gt;
  cp -v /nfs/init/gridengine/install.conf /tmp/gridengine-install.conf&lt;br /&gt;
 +++++++++++++++++++++++++++++++++++++++++++++++++++++&lt;br /&gt;
 #-------------------------------------------------&lt;br /&gt;
 # SGE default configuration file&lt;br /&gt;
 #------------------------------------------------- &lt;br /&gt;
 # Use always fully qualified pathnames, please &amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_ROOT Path, this is basic information&lt;br /&gt;
 #(mandatory for qmaster and execd installation)&lt;br /&gt;
 SGE_ROOT=&amp;quot;/usr/share/gridengine&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_QMASTER_PORT is used by qmaster for communication&lt;br /&gt;
 # Please enter the port in this way: 1300&lt;br /&gt;
 # Please do not this: 1300/tcp&lt;br /&gt;
 #(mandatory for qmaster installation)&lt;br /&gt;
 SGE_QMASTER_PORT=6444 &amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_EXECD_PORT is used by execd for communication&lt;br /&gt;
 # Please enter the port in this way: 1300&lt;br /&gt;
 # Please do not this: 1300/tcp&lt;br /&gt;
 #(mandatory for qmaster installation)&lt;br /&gt;
 SGE_EXECD_PORT=6445 &amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_ENABLE_SMF&lt;br /&gt;
 # if set to false SMF will not control SGE services&lt;br /&gt;
 SGE_ENABLE_SMF=&amp;quot;false&amp;quot;&amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_ENABLE_ST&lt;br /&gt;
 # if set to false Sun Service Tags will not be used&lt;br /&gt;
 SGE_ENABLE_ST=&amp;quot;true&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_CLUSTER_NAME&lt;br /&gt;
 # Name of this cluster (used by SMF as an service instance name)&lt;br /&gt;
 SGE_CLUSTER_NAME=&amp;quot;bkslab&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_JMX_PORT is used by qmasters JMX MBean server&lt;br /&gt;
 # mandatory if install_qmaster -jmx -auto &amp;lt;cfgfile&amp;gt;&lt;br /&gt;
 # range: 1024-65500 &lt;br /&gt;
 SGE_JMX_PORT=&amp;quot;6446&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_JMX_SSL is used by qmasters JMX MBean server&lt;br /&gt;
 # if SGE_JMX_SSL=true, the mbean server connection uses&lt;br /&gt;
 # SSL authentication&lt;br /&gt;
 SGE_JMX_SSL=&amp;quot;true&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_JMX_SSL_CLIENT is used by qmasters JMX MBean server&lt;br /&gt;
 # if SGE_JMX_SSL_CLIENT=true, the mbean server connection uses&lt;br /&gt;
 # SSL authentication of the client in addition&lt;br /&gt;
 SGE_JMX_SSL_CLIENT=&amp;quot;true&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_JMX_SSL_KEYSTORE is used by qmasters JMX MBean server&lt;br /&gt;
 # if SGE_JMX_SSL=true the server keystore found here is used&lt;br /&gt;
 # e.g. /var/sgeCA/port&amp;lt;sge_qmaster_port&amp;gt;/&amp;lt;sge_cell&amp;gt;/private/keystore&lt;br /&gt;
 SGE_JMX_SSL_KEYSTORE=&amp;quot;/var/sgeCA/sge_qmaster/bkslab/private/keystore&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_JMX_SSL_KEYSTORE_PW is used by qmasters JMX MBean server&lt;br /&gt;
 # password for the SGE_JMX_SSL_KEYSTORE file&lt;br /&gt;
 SGE_JMX_SSL_KEYSTORE_PW=&amp;quot;secret&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_JVM_LIB_PATH is used by qmasters jvm thread&lt;br /&gt;
 # path to libjvm.so&lt;br /&gt;
 # if value is missing or set to &amp;quot;none&amp;quot; JMX thread will not be installed&lt;br /&gt;
 # when the value is empty or path does not exit on the system, Grid Engine &lt;br /&gt;
 # will try to find a correct value, if it cannot do so, value is set to &lt;br /&gt;
 # &amp;quot;jvmlib_missing&amp;quot; and JMX thread will be configured but will fail to start&lt;br /&gt;
 SGE_JVM_LIB_PATH=&amp;quot;none&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # SGE_ADDITIONAL_JVM_ARGS is used by qmasters jvm thread &lt;br /&gt;
 # jvm specific arguments as -verbose:jni etc.&lt;br /&gt;
 # optional, can be empty&lt;br /&gt;
 SGE_ADDITIONAL_JVM_ARGS=&amp;quot;-Xmx256m&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # CELL_NAME, will be a dir in SGE_ROOT, contains the common dir&lt;br /&gt;
 # Please enter only the name of the cell. No path, please&lt;br /&gt;
 #(mandatory for qmaster and execd installation)&lt;br /&gt;
 CELL_NAME=&amp;quot;bkslab&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # ADMIN_USER, if you want to use a different admin user than the owner,&lt;br /&gt;
 # of SGE_ROOT, you have to enter the user name, here&lt;br /&gt;
 # Leaving this blank, the owner of the SGE_ROOT dir will be used as admin user&lt;br /&gt;
 ADMIN_USER=&amp;quot;&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # The dir, where qmaster spools this parts, which are not spooled by DB&lt;br /&gt;
 #(mandatory for qmaster installation)&lt;br /&gt;
 QMASTER_SPOOL_DIR=&amp;quot;/var/spool/gridengine/bkslab/qmaster&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # The dir, where the execd spools (active jobs)&lt;br /&gt;
 # This entry is needed, even if your are going to use&lt;br /&gt;
 # berkeley db spooling. Only cluster configuration and jobs will&lt;br /&gt;
 # be spooled in the database. The execution daemon still needs a spool&lt;br /&gt;
 # directory  &lt;br /&gt;
 #(mandatory for qmaster installation)&lt;br /&gt;
 EXECD_SPOOL_DIR=&amp;quot;/var/spool/gridengine&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # For monitoring and accounting of jobs, every job will get&lt;br /&gt;
 # unique GID. So you have to enter a free GID Range, which&lt;br /&gt;
 # is assigned to each job running on a machine.&lt;br /&gt;
 # If you want to run 100 Jobs at the same time on one host you&lt;br /&gt;
 # have to enter a GID-Range like that: 16000-16100&lt;br /&gt;
 #(mandatory for qmaster installation)&lt;br /&gt;
 GID_RANGE=&amp;quot;16000-16100&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # If SGE is compiled with -spool-dynamic, you have to enter here, which&lt;br /&gt;
 # spooling method should be used. (classic or berkeleydb)&lt;br /&gt;
 #(mandatory for qmaster installation)&lt;br /&gt;
 SPOOLING_METHOD=&amp;quot;berkeleydb&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # Name of the Server, where the Spooling DB is running on&lt;br /&gt;
 # if spooling methode is berkeleydb, it must be &amp;quot;none&amp;quot;, when&lt;br /&gt;
 # using no spooling server and it must contain the servername&lt;br /&gt;
 # if a server should be used. In case of &amp;quot;classic&amp;quot; spooling,&lt;br /&gt;
 # can be left out&lt;br /&gt;
 DB_SPOOLING_SERVER=&amp;quot;none&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # The dir, where the DB spools&lt;br /&gt;
 # If berkeley db spooling is used, it must contain the path to&lt;br /&gt;
 # the spooling db. Please enter the full path. (eg. /tmp/data/spooldb)&lt;br /&gt;
 # Remember, this directory must be local on the qmaster host or on the&lt;br /&gt;
 # Berkeley DB Server host. No NFS mount, please&lt;br /&gt;
 DB_SPOOLING_DIR=&amp;quot;/var/spool/gridengine/bkslab/spooldb&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # This parameter set the number of parallel installation processes.&lt;br /&gt;
 # The prevent a system overload, or exeeding the number of open file&lt;br /&gt;
 # descriptors the user can limit the number of parallel install processes.&lt;br /&gt;
 # eg. set PAR_EXECD_INST_COUNT=&amp;quot;20&amp;quot;, maximum 20 parallel execd are installed.&lt;br /&gt;
 PAR_EXECD_INST_COUNT=&amp;quot;20&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # A List of Host which should become admin hosts&lt;br /&gt;
 # If you do not enter any host here, you have to add all of your hosts&lt;br /&gt;
 # by hand, after the installation. The autoinstallation works without&lt;br /&gt;
 # any entry&lt;br /&gt;
 ADMIN_HOST_LIST=&amp;quot;&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # A List of Host which should become submit hosts&lt;br /&gt;
 # If you do not enter any host here, you have to add all of your hosts&lt;br /&gt;
 # by hand, after the installation. The autoinstallation works without&lt;br /&gt;
 # any entry&lt;br /&gt;
 SUBMIT_HOST_LIST=&amp;quot;&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # A List of Host which should become exec hosts&lt;br /&gt;
 # If you do not enter any host here, you have to add all of your hosts&lt;br /&gt;
 # by hand, after the installation. The autoinstallation works without&lt;br /&gt;
 # any entry&lt;br /&gt;
 # (mandatory for execution host installation)&lt;br /&gt;
 EXEC_HOST_LIST=&amp;quot;&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # The dir, where the execd spools (local configuration)&lt;br /&gt;
 # If you want configure your execution daemons to spool in&lt;br /&gt;
 # a local directory, you have to enter this directory here.&lt;br /&gt;
 # If you do not want to configure a local execution host spool directory&lt;br /&gt;
 # please leave this empty&lt;br /&gt;
 EXECD_SPOOL_DIR_LOCAL=&amp;quot;/var/spool/gridengine&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # If true, the domainnames will be ignored, during the hostname resolving&lt;br /&gt;
 # if false, the fully qualified domain name will be used for name resolving&lt;br /&gt;
 HOSTNAME_RESOLVING=&amp;quot;false&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # Shell, which should be used for remote installation (rsh/ssh)&lt;br /&gt;
 # This is only supported, if your hosts and rshd/sshd is configured,&lt;br /&gt;
 # not to ask for a password, or promting any message.&lt;br /&gt;
 SHELL_NAME=&amp;quot;ssh&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # This remote copy command is used for csp installation.&lt;br /&gt;
 # The script needs the remote copy command for distributing&lt;br /&gt;
 # the csp certificates. Using ssl the command scp has to be entered,&lt;br /&gt;
 # using  the not so secure rsh the command rcp has to be entered.&lt;br /&gt;
 # Both need a passwordless ssh/rsh connection to the hosts, which&lt;br /&gt;
 # should be connected to. (mandatory for csp installation mode)&lt;br /&gt;
 COPY_COMMAND=&amp;quot;scp&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # Enter your default domain, if you are using /etc/hosts or NIS configuration&lt;br /&gt;
 DEFAULT_DOMAIN=&amp;quot;none&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # If a job stops, fails, finish, you can send a mail to this adress&lt;br /&gt;
 ADMIN_MAIL=&amp;quot;none&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # If true, the rc scripts (sgemaster, sgeexecd, sgebdb) will be added,&lt;br /&gt;
 # to start automatically during boottime&lt;br /&gt;
 ADD_TO_RC=&amp;quot;true&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 #If this is &amp;quot;true&amp;quot; the file permissions of executables will be set to 755&lt;br /&gt;
 #and of ordenary file to 644.  &lt;br /&gt;
 SET_FILE_PERMS=&amp;quot;true&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # This option is not implemented, yet.&lt;br /&gt;
 # When a exechost should be uninstalled, the running jobs will be rescheduled&lt;br /&gt;
 RESCHEDULE_JOBS=&amp;quot;wait&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # Enter a one of the three distributed scheduler tuning configuration sets&lt;br /&gt;
 # (1=normal, 2=high, 3=max)&lt;br /&gt;
 SCHEDD_CONF=&amp;quot;1&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # The name of the shadow host. This host must have read/write permission&lt;br /&gt;
 # to the qmaster spool directory&lt;br /&gt;
 # If you want to setup a shadow host, you must enter the servername&lt;br /&gt;
 # (mandatory for shadowhost installation)&lt;br /&gt;
 SHADOW_HOST=&amp;quot;&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # Remove this execution hosts in automatic mode&lt;br /&gt;
 # (mandatory for unistallation of execution hosts)&lt;br /&gt;
 EXEC_HOST_LIST_RM=&amp;quot;&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # This option is used for startup script removing. &lt;br /&gt;
 # If true, all rc startup scripts will be removed during&lt;br /&gt;
 # automatic deinstallation. If false, the scripts won&amp;#039;t&lt;br /&gt;
 # be touched.&lt;br /&gt;
 # (mandatory for unistallation of execution/qmaster hosts)&lt;br /&gt;
 REMOVE_RC=&amp;quot;true&amp;quot; &amp;lt;br /&amp;gt; &lt;br /&gt;
 # This is a Windows specific part of the auto isntallation template&lt;br /&gt;
 # If you going to install windows executions hosts, you have to enable the&lt;br /&gt;
 # windows support. To do this, please set the WINDOWS_SUPPORT variable&lt;br /&gt;
 # to &amp;quot;true&amp;quot;. (&amp;quot;false&amp;quot; is disabled)&lt;br /&gt;
 # (mandatory for qmaster installation, by default WINDOWS_SUPPORT is&lt;br /&gt;
 # disabled)&lt;br /&gt;
 WINDOWS_SUPPORT=&amp;quot;false&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # Enabling the WINDOWS_SUPPORT, recommends the following parameter.&lt;br /&gt;
 # The WIN_ADMIN_NAME will be added to the list of SGE managers.&lt;br /&gt;
 # Without adding the WIN_ADMIN_NAME the execution host installation&lt;br /&gt;
 # won&amp;#039;t install correctly.&lt;br /&gt;
 # WIN_ADMIN_NAME is set to &amp;quot;Administrator&amp;quot; which is default on most&lt;br /&gt;
 # Windows systems. In some cases the WIN_ADMIN_NAME can be prefixed with&lt;br /&gt;
 # the windows domain name (eg. DOMAIN+Administrator)&lt;br /&gt;
 # (mandatory for qmaster installation, if windows hosts should be installed)&lt;br /&gt;
 WIN_ADMIN_NAME=&amp;quot;Administrator&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # This parameter is used to switch between local ADMINUSER and Windows&lt;br /&gt;
 # Domain Adminuser. Setting the WIN_DOMAIN_ACCESS variable to true, the&lt;br /&gt;
 # Adminuser will be a Windows Domain User. It is recommended that &lt;br /&gt;
 # a Windows Domain Server is configured and the Windows Domain User is&lt;br /&gt;
 # created. Setting this variable to false, the local Adminuser will be&lt;br /&gt;
 # used as ADMINUSER. The install script tries to create this user account&lt;br /&gt;
 # but we recommend, because it will be saver, to create this user, &lt;br /&gt;
 # before running the installation. &lt;br /&gt;
 # (mandatory for qmaster installation, if windows hosts should be installed)&lt;br /&gt;
 WIN_DOMAIN_ACCESS=&amp;quot;false&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # This section is used for csp installation mode.&lt;br /&gt;
 # CSP_RECREATE recreates the certs on each installtion, if true.&lt;br /&gt;
 # In case of false, the certs will be created, if not existing.&lt;br /&gt;
 # Existing certs won&amp;#039;t be overwritten. (mandatory for csp install)&lt;br /&gt;
 CSP_RECREATE=&amp;quot;true&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # The created certs won&amp;#039;t be copied, if this option is set to false&lt;br /&gt;
 # If true, the script tries to copy the generated certs. This&lt;br /&gt;
 # requires passwordless ssh/rsh access for user root to the&lt;br /&gt;
 # execution hosts&lt;br /&gt;
 CSP_COPY_CERTS=&amp;quot;false&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # csp information, your country code (only 2 characters)&lt;br /&gt;
 # (mandatory for csp install)&lt;br /&gt;
 CSP_COUNTRY_CODE=&amp;quot;CA&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # your state (mandatory for csp install)&lt;br /&gt;
 CSP_STATE=&amp;quot;Ontario&amp;quot; &amp;lt;br /&amp;gt; &lt;br /&gt;
 # your location, eg. the building (mandatory for csp install)&lt;br /&gt;
 CSP_LOCATION=&amp;quot;Faculty of Pharmacy&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # your arganisation (mandatory for csp install)&lt;br /&gt;
 CSP_ORGA=&amp;quot;University of Toronto&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # your organisation unit (mandatory for csp install)&lt;br /&gt;
 CSP_ORGA_UNIT=&amp;quot;Shoichet Lab&amp;quot; &amp;lt;br /&amp;gt;&lt;br /&gt;
 # your email (mandatory for csp install)&lt;br /&gt;
 CSP_MAIL_ADDRESS=&amp;quot;admin@bkslab.org&amp;quot; &amp;lt;br /&amp;gt;                                         &lt;br /&gt;
 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++&lt;br /&gt;
  vim /tmp/gridengine-install.conf   -&amp;gt; CHANGE EXEC_HOST_LIST=&amp;quot; &amp;quot; TO EXEC_HOST_LIST=&amp;quot;$HOSTNAME&amp;quot;&lt;br /&gt;
  cd /usr/share/gridengine/&lt;br /&gt;
  ./inst_sge -x -s -auto /tmp/gridengine-install.conf &amp;gt; /tmp/gridengine.log&lt;br /&gt;
  cat /tmp/gridengine.log | tee -a /root/gridengine-install.log&lt;br /&gt;
  if [ -e ${SGE_CELL} ]; then         mv -v ${SGE_CELL} ${SGE_CELL}.local; fi&lt;br /&gt;
  ln -vs /nfs/gridengine/${SGE_CELL} /usr/share/gridengine/${SGE_CELL}&lt;br /&gt;
  rm -vf /etc/sysconfig/gridengine&lt;br /&gt;
  echo &amp;quot;SGE_ROOT=${SGE_ROOT}&amp;quot; &amp;gt;&amp;gt; /etc/sysconfig/gridengine&lt;br /&gt;
  echo &amp;quot;SGE_CELL=${SGE_CELL}&amp;quot; &amp;gt;&amp;gt; /etc/sysconfig/gridengine&lt;br /&gt;
  mkdir -pv /var/spool/gridengine/`hostname -s`&lt;br /&gt;
  chown -Rv sgeadmin:sgeadmin /var/spool/gridengine&lt;br /&gt;
  chkconfig --levels=345 sge_execd on &amp;lt;br /&amp;gt;&lt;br /&gt;
  Go to sgemaster and do this:&lt;br /&gt;
  qconf -ae --&amp;gt; CHANGE THE HOSTNAME FROM &amp;quot;template&amp;quot; to hostname_of_new_exec&lt;br /&gt;
  qconf -as hostname&lt;br /&gt;
&lt;br /&gt;
HOW TO EDIT THE NUMBER OF SLOTS FOR A EXEC_HOST:&lt;br /&gt;
 qconf -mattr exechost complex_values slots=32 raiders.c.uoft.bkslab.org&lt;br /&gt;
 &amp;quot;complex_values&amp;quot; of &amp;quot;exechost&amp;quot; is empty - Adding new element(s). &amp;lt;br /&amp;gt;&lt;br /&gt;
 root@pan.slot-27.rack-1.pharmacy.cluster.uoft.bkslab.org modified &amp;quot;raiders.c.uoft.bkslab.org&amp;quot; in exechost list&lt;br /&gt;
&lt;br /&gt;
HOW TO ADD A HOSTGROUP:&lt;br /&gt;
  qconf -ahgrp @custom &lt;br /&gt;
&lt;br /&gt;
ADD THE EXECHOST TO A HOSTGROUP:&lt;br /&gt;
  qconf -mhgrp @custom&lt;br /&gt;
  service sgemaster restart&lt;br /&gt;
  # Then back on the exec_host:&lt;br /&gt;
  service sge_execd start&lt;br /&gt;
&lt;br /&gt;
To suspend jobs you do:&lt;br /&gt;
 qmod -sj job_number&lt;br /&gt;
&lt;br /&gt;
To delete nodes I did the following:&lt;br /&gt;
 qconf -shgrpl  -&amp;gt; To see a list of host groups&lt;br /&gt;
 qconf -shgrp @HOST_GROUP_NAME  -&amp;gt; For each host group to see if the nodes you want to delete are listed&lt;br /&gt;
If it is listed then:&lt;br /&gt;
 qconf-mhgrp @HOST_GROUP_NAME -&amp;gt; Modify this file (delete the line with the node you want to delete).&lt;br /&gt;
Once you&amp;#039;ve deleted the node you want to delete from all the hostgroups:&lt;br /&gt;
 qconf -de node_you_want _to_delete &amp;gt;/dev/null&lt;br /&gt;
 qmod -de node_you_want _to_delete&lt;br /&gt;
&lt;br /&gt;
To alter the priority on all the jobs for a user:&lt;br /&gt;
 qstat -u user | cut -d &amp;#039; &amp;#039; -f2 &amp;gt;&amp;gt; some_file&lt;br /&gt;
Edit some_file and delete the first couple lines (the header lines)&lt;br /&gt;
 for OUTPUT in $`cat some_file`; do qalter -p 1022 $OUTPUT; done;&lt;br /&gt;
 Priorities are -1024 to 1023&lt;br /&gt;
&lt;br /&gt;
DEBUGGING SGE:&lt;br /&gt;
 qstat -explain a&lt;br /&gt;
 for HOSTGROUP in `qconf -shgrpl`; do for HOSTLIST in `qconf -shgrp $HOSTGROUP`; do  echo $HOSTLIST; done; done | grep node-1.slot-27.rack-2.pharmacy.cluster.uoft.bkslab.org&lt;br /&gt;
&lt;br /&gt;
Look at the logs for both master and exec &lt;br /&gt;
(raiders:/var/spool/gridengine/raiders/messages and pan:/var/spool/gridengine/bkslab/qmaster/messages)&lt;br /&gt;
&lt;br /&gt;
Make sure resolv.conf looks like this:&lt;br /&gt;
 nameserver 142.150.250.10&lt;br /&gt;
 nameserver 10.10.16.64&lt;br /&gt;
 search cluster.uoft.bkslab.org uoft.bkslab.org bkslab.org                                                  &lt;br /&gt;
&lt;br /&gt;
 [root@pan ~]# for X in $`qconf -shgrpl`; do qconf -shgrp $X; done;&lt;br /&gt;
 Host group &amp;quot;$@24-core&amp;quot; does not exist&lt;br /&gt;
 group_name @64-core&lt;br /&gt;
 hostlist node-26.rack-2.pharmacy.cluster.uoft.bkslab.org&lt;br /&gt;
 group_name @8-core&lt;br /&gt;
 hostlist node-2.slot-27.rack-1.pharmacy.cluster.uoft.bkslab.org \&lt;br /&gt;
         node-1.slot-27.rack-1.pharmacy.cluster.uoft.bkslab.org&lt;br /&gt;
 group_name @allhosts&lt;br /&gt;
 hostlist @physical @virtual&lt;br /&gt;
 group_name @physical&lt;br /&gt;
 hostlist node-26.rack-2.pharmacy.cluster.uoft.bkslab.org&lt;br /&gt;
 group_name @virtual&lt;br /&gt;
 hostlist node-2.slot-27.rack-1.pharmacy.cluster.uoft.bkslab.org \&lt;br /&gt;
         node-1.slot-27.rack-1.pharmacy.cluster.uoft.bkslab.org&lt;br /&gt;
&lt;br /&gt;
1)  In one screen I would type strace qstat -f and then in the other screen I would type ps -ax | grep qstat to get the pid.  &amp;lt;br /&amp;gt;&lt;br /&gt;
Then ls -l /proc/pid/fd/ &amp;lt;br /&amp;gt;&lt;br /&gt;
I did this because when I typed strace qstat -f everytime it would get stuck saying this:&lt;br /&gt;
 poll([{fd=3, events=POLLIN|POLLPRI}], 1, 1000) = 0 (Timeout)&lt;br /&gt;
 gettimeofday({1390262563, 742705}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 742741}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 742771}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 742801}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 742828}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 742855}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 742881}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 742909}, NULL) = 0&lt;br /&gt;
&lt;br /&gt;
and then eventually it would say this:&lt;br /&gt;
 poll([{fd=3, events=POLLIN|POLLPRI}], 1, 1000) = 1 ([{fd=3, revents=POLLIN}])&lt;br /&gt;
 gettimeofday({1390262563, 960292}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 960321}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 960349}, NULL) = 0&lt;br /&gt;
 &amp;lt;nowiki&amp;gt;read(3, &amp;quot;&amp;lt;gmsh&amp;gt;&amp;lt;dl&amp;gt;99&amp;lt;/dl&amp;gt;&amp;lt;/gms&amp;quot;, 22)   = 22&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
 read(3, &amp;quot;h&amp;quot;, 1)                         = 1&lt;br /&gt;
 read(3, &amp;quot;&amp;gt;&amp;quot;, 1)                         = 1&lt;br /&gt;
 read(3, &amp;quot;&amp;lt;mih version=\&amp;quot;0.1\&amp;quot;&amp;gt;&amp;lt;mid&amp;gt;2&amp;lt;/mid&amp;gt;&amp;lt;&amp;quot;..., 99) = 99&lt;br /&gt;
 read(3, &amp;quot;&amp;lt;ccrm version=\&amp;quot;0.1\&amp;quot;&amp;gt;&amp;lt;/ccrm&amp;gt;&amp;quot;, 27) = 27&lt;br /&gt;
 gettimeofday({1390262563, 960547}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 960681}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 960709}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 960741}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 960769}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 960797}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 960823}, NULL) = 0&lt;br /&gt;
 shutdown(3, 2 /* send and receive */)   = 0&lt;br /&gt;
 close(3)                                = 0&lt;br /&gt;
 gettimeofday({1390262563, 961009}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 961036}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 961064}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 961093}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 961120}, NULL) = 0&lt;br /&gt;
 gettimeofday({1390262563, 961148}, NULL) = 0 &lt;br /&gt;
&lt;br /&gt;
The thing that is weird about this is when I typed ls -l /proc/pid/fd/ there was never a file descriptor &amp;quot;3&amp;quot;&lt;br /&gt;
&lt;br /&gt;
2) I tried to delete the nodes that we moved to SF by doing the following:&lt;br /&gt;
 qconf -dattr @physical &amp;quot;node-1.rack-3.pharmacy.cluster.uoft.bkslab.org node-10.rack-3.pharmacy.cluster.uoft.bkslab.org node-11.rack-3.pharmacy.cluster.uoft.bkslab.org node-12.rack-3.pharmacy.cluster.uoft.bkslab.org  node-13.rack-3.pharmacy.cluster.uoft.bkslab.org node-14.rack-3.pharmacy.cluster.uoft.bkslab.org node-15.rack-3.pharmacy.cluster.uoft.bkslab.org node-2.rack-3.pharmacy.cluster.uoft.bkslab.org node-26.rack-3.pharmacy.cluster.uoft.bkslab.org node-27.rack-3.pharmacy.cluster.uoft.bkslab.org node-29.rack-3.pharmacy.cluster.uoft.bkslab.org node-3.rack-3.pharmacy.cluster.uoft.bkslab.org node-4.rack-3.pharmacy.cluster.uoft.bkslab.org node-5.rack-3.pharmacy.cluster.uoft.bkslab.org node-6.rack-3.pharmacy.cluster.uoft.bkslab.org node-7.rack-3.pharmacy.cluster.uoft.bkslab.org node-8.rack-3.pharmacy.cluster.uoft.bkslab.org node-9.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; node-1.rack-3.pharmacy.cluster.uoft.bkslab.org @physical &amp;gt; /dev/null&lt;br /&gt;
&lt;br /&gt;
I would get the error: &lt;br /&gt;
 Modification of object &amp;quot;@physical&amp;quot; not supported&lt;br /&gt;
&lt;br /&gt;
3) I tried to see the queues complex attributes by typing qconf -sc and saw this:&lt;br /&gt;
 #name       shortcut   type        relop requestable consumable default  urgency &lt;br /&gt;
 slots               s          INT         &amp;lt;=        YES         YES            1        1000&lt;br /&gt;
&lt;br /&gt;
I am not quite sure what urgency = 1000 means.&lt;br /&gt;
All other names had &amp;quot;0&amp;quot; under urgency.&lt;br /&gt;
&lt;br /&gt;
4) I tried qmod -cq &amp;#039;*&amp;#039;  to clear the error state of all the queues.  &lt;br /&gt;
It would tell me this:&lt;br /&gt;
&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-1.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-1.slot-27.rack-1.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-1.slot-27.rack-2.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-10.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-11.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-12.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-13.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-14.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-15.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-2.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-2.slot-27.rack-1.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-2.slot-27.rack-2.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-26.rack-2.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-26.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-27.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-29.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-3.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-3.slot-27.rack-2.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-4.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-4.slot-27.rack-2.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-5.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-5.slot-27.rack-2.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-6.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-6.slot-27.rack-2.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-7.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-7.slot-27.rack-2.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-8.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
 Queue instance &amp;quot;all.q@node-9.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot; is already in the specified state: no error&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
5) I tried deleting a node like this instead:&lt;br /&gt;
 qconf -ds node-1.rack-3.pharmacy.cluster.uoft.bkslab.org&lt;br /&gt;
But when I typed qconf -sel it was still there.&lt;br /&gt;
&lt;br /&gt;
6)  I tried to see what the hostlist for @physical was by typing qconf -ahgrp @physical.  It said: group_name @physical, hostlist NONE&lt;br /&gt;
Then I typed qconf -shgrpl to see a list of all hostgroups and tried typing qconf -ahgrp.  All of them said the hostlist was NONE, &lt;br /&gt;
but when I tried to type qconf -ahgrp @allhosts I got this message:&lt;br /&gt;
 denied: &amp;quot;root&amp;quot; must be manager for this operation&lt;br /&gt;
 error: commlib error: got select error (Connection reset by peer)&lt;br /&gt;
&lt;br /&gt;
7) I looked at the messages in the file: /var/spool/gridengine/bkslab/qmaster/messages and it said this (over and over again):&lt;br /&gt;
&lt;br /&gt;
 01/20/2014 19:41:35|listen|pan|E|commlib error: got read error (closing &amp;quot;pan.slot-27.rack-1.pharmacy.cluster.uoft.bkslab.org/qconf/2&amp;quot;)&lt;br /&gt;
 01/20/2014 19:43:24|  main|pan|W|local configuration pan.slot-27.rack-1.pharmacy.cluster.uoft.bkslab.org not defined - using global configuration&lt;br /&gt;
 01/20/2014 19:43:24|  main|pan|W|can&amp;#039;t resolve host name &amp;quot;node-3-3.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot;: undefined commlib error code&lt;br /&gt;
 01/20/2014 19:43:24|  main|pan|W|can&amp;#039;t resolve host name &amp;quot;node-3-4.rack-3.pharmacy.cluster.uoft.bkslab.org&amp;quot;: undefined commlib error code&lt;br /&gt;
 01/20/2014 19:43:53|  main|pan|I|read job database with 468604 entries in 29 seconds&lt;br /&gt;
 01/20/2014 19:43:55|  main|pan|I|qmaster hard descriptor limit is set to 8192&lt;br /&gt;
 01/20/2014 19:43:55|  main|pan|I|qmaster soft descriptor limit is set to 8192&lt;br /&gt;
 01/20/2014 19:43:55|  main|pan|I|qmaster will use max. 8172 file descriptors for communication&lt;br /&gt;
 01/20/2014 19:43:55|  main|pan|I|qmaster will accept max. 99 dynamic event clients&lt;br /&gt;
 01/20/2014 19:43:55|  main|pan|I|starting up GE 6.2u5p3 (lx26-amd64)&lt;br /&gt;
&lt;br /&gt;
8)  Periodically i would get this error:  &lt;br /&gt;
 ERROR: failed receiving gdi request response for mid=3 (got no message).&lt;br /&gt;
&lt;br /&gt;
9)  I also tried delete the pid in the file: /var/spool/gridengine/bkslab/qmaster/qmaster.pid&lt;br /&gt;
That didn&amp;#039;t do anything.  It eventually just replaced it with a different number. &lt;br /&gt;
It&amp;#039;s weird because it&amp;#039;s not even the right pid.  For example the real pid was 8286 and the pid in the file was 8203:&lt;br /&gt;
&lt;br /&gt;
 [root@pan qmaster]# service sgemaster start&lt;br /&gt;
 Starting sge_qmaster:                                      [  OK  ]&lt;br /&gt;
 [root@pan qmaster]# ps -ax |grep sge&lt;br /&gt;
 Warning: bad syntax, perhaps a bogus &amp;#039;-&amp;#039;? See /usr/share/doc/procps-3.2.8/FAQ&lt;br /&gt;
 8286 ?        Rl     0:03 /usr/bin/sge_qmaster&lt;br /&gt;
 8301 pts/0    S+     0:00 grep sge&lt;br /&gt;
 [root@pan qmaster]# cat qmaster.pid &lt;br /&gt;
 8203&lt;br /&gt;
&lt;br /&gt;
10)   When I typed tail /var/log/messages I saw this:&lt;br /&gt;
 Jan 20 14:25:05 pan puppet-agent[2021]: Could not request certificate: Connection refused - connect(2)&lt;br /&gt;
 Jan 20 14:27:05 pan puppet-agent[2021]: Could not request certificate: Connection refused - connect(2)&lt;br /&gt;
 Jan 20 14:29:05 pan puppet-agent[2021]: Could not request certificate: Connection refused - connect(2)&lt;br /&gt;
 Jan 20 14:31:05 pan puppet-agent[2021]: Could not request certificate: Connection refused - connect(2)&lt;br /&gt;
 Jan 20 14:33:06 pan puppet-agent[2021]: Could not request certificate: Connection refused - connect(2)&lt;br /&gt;
 Jan 20 14:35:06 pan puppet-agent[2021]: Could not request certificate: Connection refused - connect(2)&lt;br /&gt;
 Jan 20 14:36:29 pan kernel: Registering the id_resolver key type&lt;br /&gt;
 Jan 20 14:36:29 pan kernel: FS-Cache: Netfs &amp;#039;nfs&amp;#039; registered for caching&lt;br /&gt;
 Jan 20 14:36:29 pan nfsidmap[2536]: nss_getpwnam: name &amp;#039;root@rack-1.pharmacy.cluster.uoft.bkslab.org&amp;#039; does not map into domain &amp;#039;uoft.bkslab.org&amp;#039;&lt;br /&gt;
 Jan 20 14:37:06 pan puppet-agent[2021]: Could not request certificate: Connection refused - connect(2)&lt;br /&gt;
This was what happened when I restarted the machine.&lt;/div&gt;</summary>
		<author><name>Benrwong</name></author>
	</entry>
</feed>